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We study a model for a massive test particle in a microscopic periodic potential and 
r~| | interacting with a reservoir of light particles. In the regime considered, the fluctuations in 

the test particle's momentum due to collisions typically outweigh the shifts in momentum 
r~| . due to the periodic force, and so, the force is effectively a perturbative contribution. The 

mathematical starting point is an idealized reduced dynamics for the test particle given by 
a linear Boltzmann equation. In the limit that the mass ratio of a single reservoir particle 
to the test particle tends to zero, we show that under the standard normalizations for 
the test particle variables, there is convergence to the Ornstein-Uhlenbeck process. Our 
J> ' analysis is primarily directed towards bounding the perturbative effect of the periodic 

■ potential on the particle's momentum. 

O 

1 Introduction 

The Ornstein-Uhlenbeck process offers a homogenized picture for the motion of a massive parti- 
cle interacting with a gas of lightweight particles at fixed temperature [33] . In this description, 
the spatial degrees of freedom are driven ballistically by momentum variables which are them- 
selves governed by a diffusion equation that includes a drift term corresponding to the drag 
felt by the massive particle as it accumulates speed and has more frequent collisions with the 
gas. Under diffusive rescaling, the spatial variables converge in law to a Brownian motion. 
This result follows by an elementary analysis of the closed formulas available for the Ornstein- 
Uhlenbeck process [26]. The Brownian motion description for the test particle transport is 
effectively "more macroscopic" than the Ornstein-Uhlenbeck model, since the fluctuations in 
the particle's momentum are integrated into infinitesimal spatial "jumps" for the Brownian 
particle. 

In the other direction, we may consider derivations of the Ornstein-Uhlenbeck process from 
models which are "more microscopic". These relatively microscopic descriptions may merely 
be more complicated stochastic models for the test particle (such as a Boltzmann or linear 
Boltzmann equation), or, more fundamentally, a reduced dynamics for the test particle starting 
from a full microscopic model which includes the evolution of the degrees of freedom for the 
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gas. The stochastic model in the former case should be regarded as an intermediary picture 
between the Ornstein-Uhlenbeck and the Hamiltonian dynamics arising in some limit (see [30] 
for a discussion of the low density limit). In the Boltzmann models, the test particle undergoes 
a Markovian dynamics, whereas for the Hamiltonian model including the gas, the randomness 
is only in the initial configuration, and the resulting dynamics for the test particle given by 
integrating out the gas is non-Markovian. In the other direction, the contrast between the 
Ornstein-Uhlenbeck and the Boltzmann-type dynamics is that the momentum in the Boltzmann 
case makes discrete jumps (which are individually small in the Brownian limit) corresponding 
to collisions with gas particles rather than evolving with continuous trajectories according to a 
Langevine equation as in the Ornstein-Uhlenbeck case. We refer to the book [26J for a discussion 
of these various levels of description for a Brownian particle. 

Rigorous mathematical derivations of the Ornstein-Uhlenbeck process were achieved in [151 
[3] from stochastic models giving an effective description of the test particle receiving collisions 
from particles in a background gas. For models which begin with a full mechanical Hamiltonian 
model including the test particle and the gas, derivations of the Ornstein-Uhlenbeck process 
from the reduced dynamics of the test particle were obtained in [TBI QUI EE] • 

In this article, we consider the Brownian regime for a stochastic model in which a one- 
dimensional test particle makes jumps in momentum (interpreted as collisions with a back- 
ground gas) and is acted upon by a force from an external, spatially periodic potential field. 
Due to the presence of the field, the momentum process is no longer Markovian, since it drifts 
at a rate depending on the particle's position. The momentum of the particle has two contri- 
butions: the total displacement in momentum due to the field given by a time integral of the 
force and the sum of the momentum jumps from collisions. Due to the specific scaling regime 
considered (which includes the period length of the potential), the force field typically makes a 
smaller-scale contribution to the test particle's momentum than the fluctuations in momentum 
due to the jumps ("collisions with the gas"). The vanishing of the force contribution is an av- 
eraged effect driven by the frequent rate at which the test particle is typically passing through 
the period cells of the potential field. The Brownian limit of the model to first-order thus yields 
the same Ornstein-Uhlenbeck process as if the force were set to zero. Our analysis is focused on 
obtaining a sharp upper bound for the influence of the external potential on the momentum of 
the particle, and our techniques improve those applied to a related model in [9] . Ultimately, the 
main contributions to the total drift in momentum due to the forcing are made during "rare" 
time periods at which the test particle's momentum returns to "small" values. The results of 
this article are extended in [7] to prove that the integral of the force (or net displacement in 
momentum due to the potential) converges in law to a fractional diffusion whose rate depends 
on the amount time that the limiting Ornstein-Uhlenbeck process spends at zero momentum 
(i.e. the local time at zero). 

Our model is a linear Boltzmann dynamics for a one-dimensional particle making elastic 
collisions with the gas and including a spatially periodic potential. The jump rate kernel is 
the one-dimensional case of the formula appearing in [30| Ch. 8.6], which corresponds to a 
hard-rod interaction between the test particle and a single reservoir particle. However, since 
the model is one-dimensional, it cannot be derived from a mechanical microscopic dynamics 
in the Boltzmann-Grad limit. We thus regard our model as phenomenological, and we argue 
that the resulting behavior that we find is qualitatively the same as what should be expected 
in an analogous three-dimensional model for a Brownian particle in a one-dimensional periodic 
potential. 

We think of our model as corresponding to an experimental situation for a large atom or 
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molecule in a periodic standing-wave light field (optical lattice) and interacting with a dilute 
background gas. A periodic optical force on an atom can be produced experimentally by 
counter-propagating lasers (see, for instance, [22] or the reviews [H[25]). A classical treatment 
of the atom is reasonable in the regime where the potential is effectively weak, since the test 
particle is typically not constrained by the potential and the coherent quantum effects for the 
test particle will be suppressed by interactions with the gas. 



1.1 Model and results 

We will consider a one-dimensional particle of mass M interacting with a gas of particles with 
mass m for = A <C 1 in the presence of a force ^r(f) for some smooth V : R — > 1R + 
with period a > 0. We take the phase space density fy t ,\{x,p) at time t > to obey a linear 
Boltzmann equation 

d T , . X d . , dV ,x, d T . . 
dt % ^ P) = ~m P ^ % ^ p) + d^x) dp % ^ p) 

dp'(j x {p',p)^ tyX {x,p') - J x (p,p')^ tyX {x,p)), (1.1) 

where J x (p' ',p) is a kernel describing the rate of kicks in momentum p' — >• p for the massive 
particle due to collisions with reservoir particles. Since we are considering an ideal gas, the 
rates J x {j>' \p) will be determined by the interaction potential between the test particle and a 
reservoir particle, the temperature the ratio X = and the spatial density r/. We will 
take the rates J x {p' \p) to correspond to a hard-rod interaction (or alternatively "hard-point", 
since the dimensionality of the objects do not appear for the one-dimensional linear Boltzmann 
equation) which has the form 

„(1 + A) , ,e-M^''-^'Y 

The jump rates J x are the explicit form of those in equation (8.118) from [5U] for the dimension- 
1 case, written in momentum variables rather than velocities. 

We will denote the stochastic process whose probability density evolves according to (11. ip 
by (X t ,P t ). Let us also define the process 

* J dx ^ X ^ 

The process D t is the cumulative drift in the particle's momentum due to the periodic force 
field, and hence the momentum at time t has the form 

P t = P + D t + J t , (1.3) 

where J t is the sum of all the momentum jumps due to collisions with the gas over the time 
interval [0, t]. 

Let (qt,pt) £ K 2 be a process satisfying the Langevine equations 

dq t = —p t dt, 

m 

dp t = - lPtd t+(^) h dB' t , (1.4) 
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where 7 = 877 (^g)S and B; is a standard Brownian motion. 

Since our model is rather specific, the technical assumptions for our main results are only 
the following list. 

List 1.1. 



1. The potential V(x) = V(x + a) is continuously differentiable. 

2. The probability measure \x on IR 2 for the initial location in phase space (X , P Q ) has finite 
moments in momentum J s dfi(x,p) \p\ m < 00, m > 1. 

The following theorem gives the main result of this article. It states that as A — > the 
momentum process P* rescaled by a factor A 2 converges to an Ornstein-Uhlenbeck process. 

Theorem 1.2. Assume List \1. 11 In the limit A — > 0, there is convergence in law of the process 
AaPj. to the Ornstein-Uhlenbeck process p t over the interval [0, T}. The convergence is with 
respect to the uniform metric. Moreover, 



sup 

A<1 



E (A) 



sup \\±D 

0<t<T 



< OO. 



1.2 Further discussion 

This article concerns the dynamics of a Brownian particle which feels a light force from a 
one-dimensional periodic optical potential (optical lattice). There is considerable range in the 
period and amplitude that can be experimentally produced for an optical lattice [25], and we 
focus on a regime in which the potential is "microscopic" . By "microscopic" , we mean that the 
potential has an amplitude sup x y \V(x) — V(x')\ which is much smaller than the typical kinetic 
energy = A -1 ^ of the test particle at equilibrium with the heat bath, and that the period 
a is small enough so that the typical rate at which the particle passes through the period cells 
(a 2 M(3)~2 is much faster than the rate of energy relaxation « A _1 7 for the test particle. 

For our mathematical analysis, the force F(x) = ^r(f) is taken to have a period which 
scales proportionally to the mass ratio A = ||: aX. This is not essential to these results, and 
only the broad features described above are critical. The same can be said about the amplitude 
of the potential. 

Theorem 11.21 states that to first approximation under Brownian rescaling, the momentum is 
an Ornstein-Uhlenbeck process with no dependence on the potential. This classical treatment of 
the particle allows for comparisons with quantum models. A similar model for a one-dimensional 
quantum particle was studied in [6] for which the potential is a periodic 5-potential. In that 
case, the singular potential makes a first-order change to the dynamics characterized by spatial 
subdiffusion caused by Bragg reflections even though the periodic potential is "microscopic" in 
a similar sense as described above. See [U[T3l[T9] for examples of experimental investigations of 
Bragg reflections of atoms from optical potentials. Analogous quantum models with smoother 
potentials will behave more like their classical counterparts. 

A three-dimensional linear Boltzmann dynamics for a particle in a gas of hard spheres and 
under the influence of a one-dimensional optical lattice will have the same limit result (up to 
the constants) as in Theorem 11.21 for the degree of freedom in the direction of the potential. 
Although the momentum for a single spatial degree of freedom is not Markovian in the linear 
Boltzmann description, it becomes "more Markovian" in the Brownian limit as is seen in the 
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limiting three-dimensional Ornstein-Uhlenbeck process. The rates fl X . 2 1) can then be replaced 
by the effective rates that emerge for a single degree of freedom in the three dimensional case, 
and which have the same qualitative features for our purposes. 



1.2.1 Features of the model 

By rescaling the spatial coordinate for the particle by a factor of A -1 , the master equation (11. ip 
becomes 

|*,, A (x, p) = C- x (VM*,p) = -££vu*,p) + £(*)|»*(*.p) 



dp'{Mp',p)9,, x (x,p') - Mp,p')9,,x(x,p)), (1.5) 

where the generator C* x is defined by the second equality. We thus effectively have a particle 
with Hamiltonian H(x,p) = ^p 2 + V(x) and a A-dependent noise. Note that under the new 
spatial metric, the velocity of the test particle is — rather than = A— . For the purpose of 
Theorem 11.21 and this article generally, it is sufficient to consider the spatial degree of freedom 
to be a unit torus T = [0, 1) so that the total state space is E = T x R. The equilibrium state 
for the dynamics on S is given by the Maxwell-Boltzmann distribution 

e -(3\H(x,p) 

Voo,x(x,p)= (1.6) 

for some normalization N(X). 

After the spatial stretching, the drift process in momentum D t has the form 

D t = [ drg(X r ,P r ) (1.7) 
J o 

for g : E — > R + given by g(x,p) = ^%:{%)- It is thus an integral functional of an exponentially 
ergodic Markov process on S. Nonetheless, the central limit theorem for \±Dt stated in 
Theorem 11.21 does not follow from the limit theory for integral functionals of ergodic Markov 
processes [2U], since the relaxation to the state (jl.fip only occurs on the time scale A" 1 ^> 1. 
Indeed, there must be many collisions with reservoir particles before there is memory loss for 
the heavy particle. As explained in Sections II .2.24fT7273| the analysis of \*D± is more related 
to the limit theory for martingales whose bracket processes are additive functionals of a null- 
recurrent Markov process [17] . This is due to the fact that the fluctuations in D t accumulate 
mainly during time intervals in which \P t \ is much smaller than the typical momentum-size 

{jpY > 1 for the state ^W- 

1.2.2 Rough picture of the behavior in the Brownian regime A 1 

Since the equilibrium state of the dynamics is given by the Maxwell-Boltzmann distribu- 
tion (jl.6p . the typical energy for the particle when A 1 will be on the order A -1 . Moreover, 
the potential V(x) is bounded, so most of the energy will be in the kinetic component ^p 2 
corresponding to momenta with \p\ on the order of A~^ 3> 1. The jump rates J\{p-,p') for 
\p\ = 0(A~5) are approximately 

Jx(p,p') =j(p- p') + X P^J(P - P) + (1-8) 
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where the idealized rates j(p) have the form 



J(P) = ^"IpIt^TT- (1-9) 



2m 1 '(27rf 



P' 2 



The second term on the right is 0{\i) by our assumption \p\ = 0(A 2). The physical meaning 
behind the approximation (jl.8p is that the gas reservoir particles are typically moving at speeds 
on the order which is greater than the typical speed of the test particle (4f) 5 = A2 (?j)2 <c 

(if)^ 0- The statistics for the momentum transfers from the gas thus do not depend strongly 
on the momentum of the test particle, and have approximately a convolution form as in the 
zeroth-order term in ( tTjjJ) . The zeroth-order approximation in (jl.8p suggests that the collision 
component J t of the momentum ( II. 3p is typically behaving as an unbiased random walk with 
increments having density j(v). Based on this reasoning, A 2 Jt should converge by the central 
limit theorem to a Brownian motion with diffusion constant ^jp- as A — > 0. However, the 

first-order term in (jl.8p generates a drift for \2Jt_ which is retained as A — > and converges 
to a limit by a law of large numbers. This can be seen in the friction term appearing in the 
Langevine equation ( 11. 4ft . 

According to the heuristics above, J± should typically be found on the scale A~2 when 

A 1 and t E [0, T], and we will now argue that D± should typically be 0(A _ *). We can 
parse the integral for D t according to the collision times t n as 

JtM t n= i Jt n -1 

where to — and Mt is the number of collisions up to time t. Between collisions from the gas, 
the particle evolves deterministically according to the Hamiltonian H(x,p) = \p 2 + V(x), and 
Newton's equations give 

P t - n -P t ^= T dr%(Xr). (1.10) 

If H(X tn _ 1 , P tn _ 1 ) > 2sup x . V(x), then the momentum will not change signs over the interval 
[t n -i,t n ), and 



\Pt- ~Pt A< 



Pt n -A - JPL, + 2V{X^) - 2V{X tn ) 



< 2sup x V(x) 
\Pt„-i\ 



which follows from the conservation of energy and the quadratic formula. Thus, when |-Pt n _J 
is on the typical order oc X~^, then the increment (ll.lOp of the momentum drift is O(X^) 1. 

In fact, there is another critical feature of an ergodic nature that make the contribu- 
tions ( ll.lOp to D t even smaller when |Pt„_ 1 | ^> 1. There is an ergodicity on the spatial torus 
relating to the fact that when the momentum is high, then the particle revolves quickly around 
the torus, and its location at the time of the next collision is close to uniform. This idea can 
be used to show that the mean for J t * n i dr ^(X r ) is 0(\P tn _ 1 \~ 2 ) when given the information 
known up to time t n -2- I n other words, besides the increments (II. 1 j) just being small when 



1 These velocities refer to the original length scale, before stretching by a factor of A 
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\Pt n - 1 1 3> 1) Dt is also behaving like a martingale since the increments are close to having mean 
zero and being uncorrelated. Thus, there is a central limit theorem-like cancellation among the 
terms. This motivates that the contribution to Dt from time intervals where \P r \ oc A 



is 



o(\- 



since for fixed small e > 



Mt 



n=l 



dx 



o(e[x>(i^j>^)( f dr^-(x r : 2 

n=l Jtn-1 



dx 



< e" 2 E[AT 1 ]0(A) = 0(1), 



because the collisions occur with a frequency on the order of one per unit time E[A/i] = 0(f). 

These contributions disappear for the normalized expression X^D±. (For technical reasons, our 
analysis of these facts is actually done with a different set of artificially introduced stopping 
times rather than the collision times. See Section I4.4j) . 

The above arguments motivate that Dt spends the greater portion of the time interval 
t G [0, T] behaving as a constant, or, said differently, its larger ffuctuations-if they occur-must 
typically be concentrated on a small fraction of the interval [0, T]. Let us consider the order 
of the contributions to D t which are likely to occur for the periods of time when P r returns to 
the region around the origin, that is \P r \ = 0(1). If Pt is behaving roughly as a random walk 
for t e [0, f ] with some very weak friction, then we expect that P t spends on the order of A 2 
time in the vicinity of the origin. If there is central limit theorem-like cancellations between 
the increments J t " ^dr ^(X r ) in those time periods, then Dt should be expected to be on the 

scale A~3. 



1.2.3 Techniques and strategy of the proof 

The main difficulty in showing that X^Pt converges in law to the Ornstein-Uhlenbeck process p t 

is to show that component Dt of the momentum is typically o(A~5) for t E [0, T). As indicated 
by the heuristics of Section [1.2.2[ we should expect, in fact, that typically sup 0<t<r \Dt | is 

0(A~*). 

One of the main ingredients is a splitting technique which consists in introducing an artificial 
"atom" into the state space (if it doesn't already have one, as in our case) by embedding the 
original process as a component of a process living in an enlarged state space. In principle, 
the benefit for having an extended state space with an atom is that the trajectories for the 
process S t can be decomposed into a series of i.i.d. parts (i.e. life cycles) corresponding to time 
intervals [R n , R n +i) where R n are the return times to the atom. This would allow the integrable 
functional D t to be written as a pair of boundary terms plus a sum of i.i.d. random variables 
(with a random number of terms). For Markov chains, such a technique for embedding an atom 
was developed independently in [28] and [2], and is referred to as Nummelin splitting or merely 
splitting. When it comes to splitting a Markov process, there are different schemes available. 
In [T7] , there is a sequence of split processes constructed which contain marginal processes that 
are arbitrarily close to the original process. The construction in [21] involves a larger state 
space X x [0, 1] x S, although an exact copy of the original process is embedded as a marginal. 
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The idea that splitting constructions could be used as a tool to prove certain limit theorems 
for Markov processes was suggested in an unpublished paper |32j . 

We use a truncated version of the split process introduced in [21]. The split process is not 
Markovian itself, but contains an embedded chain (the split resolvent chain) which is Markovian. 
The life cycles for the process are not completely independent in this construction, since there 
are correlations between successive life cycles. The details of the construction are explained at 
the beginning of Section HJ and the discussion is self-contained. The original process St, that 
lives in E = T x R is embedded as a component of St = (S t , Z t ) 6 E = S x {0, 1}. The process 
D t can be written as four boundary terms plus a martingale 

D t = (Sum of boundary terms) + M t (1-H) 

n=l J Hn 

where N t is the number of returns to the atom E x 1 to have occurred before time t, and 
<K (A) : L°°(E) L°°(E) is the reduced resolvent of the backwards generator C\. The boundary 
terms are 

f^ W )-f-^ W + («<^), SBl ,-(^f)(^,. 

The interjection of the telescoping terms (9^^r) (»Sh„) removes the correlations between suc- 
cessive life cycles. The fact that the increments of M t have mean zero with respect to the 
information known up to time R n is a consequence of the splitting construction and the fact 
that the observable has mean zero in the equilibrium state ^oo,x- The process Mt is a 
martingale with respect to its own filtration, and this opens the possibility of using Doob's 
inequality to the bound M t . 

The martingale M t is a variant of the martingale M' below that is usually employed when 
studying limit theorems for integral functionals of Markov processes: 

^ = ^ % ){St) " ^ % ){So) + A ' (L12) 
The martingale M' has predictable quadratic variation 

Using the martingale ( 11.121) would require showing some decay which is uniform in A < 1 
for increments |(9 c t^ %)( x iP') ~ (p^ wnen \p\, W\ are large and \p — p'\ = 0(1). 

However, it is not clear for us how to obtain the necessary bounds on the resolvent, and our 
methods are designed for exploiting time-averaging of the process ^(X r ) as suggested by the 
heuristics in Section 11.2.21 Our technique is based on having bounds for a generalized resolvent 
U ( h X) : L°°(E) -> L°°(E) of the form 

POO 

(ul X) g)(s)=EW\ dte-ti drh ^g(S t ) 



'-jo 



S 



where h in a non-negative function with compact support, and the function g 
has the form 

JV 



gx essentially 



9x{s) 



E (A) 









dtte 



The operator arises in the study of recurrence for the Markov process and has been referred 

(A) 



to as the state-modulated resolvent [23J. Analysis of Uj. for our dynamics is contained in [8]. 



1.2.4 The unit conventions and organization of the article 

Throughout the remainder of the article, we will remove units by setting /3 — a — m — 1, and 
picking rj such that 7 = |. We assume List 11.11 in all theorems, lemmas, etc. unless otherwise 
stated. Section [2] develops preliminary bounds on the typical energy of the particle. Section |3] 
contains some analysis of the collision component Jt of the momentum. Section H] is directed 
toward proving that sup 0<t<T \D±\ is typically 0(A~*). The proof of the convergence of \?P± 
to the Ornstein-Uhlenbeck process pt for small A is found in Section |5l and Section [6] contains 
miscellaneous proofs that we regarded as mechanical or routine. 



2 Typical energy behavior 

Functions of the energy H(x,p) = \p 2 + V(x) have the advantage of being invariant under 
the Hamiltonian evolution. This makes energy related quantities a desirable starting point for 
gaining some control over the typical behavior of the dynamics. 

Define the functions A\, V\, n : T x R ->• R and £\(p) : R -> [0, 00) as 

£x(p) = I dp'J x (p,p'), 

A x (x,p) = / dp'(2^H(x,p')^ -2?H(x,p)*)Jx(p,p'), 

Va,„(x,p) = f dp'(2^H(x,p'^-2 l 2 H(x,p^-^p-) 2n jx(p,p'). 

Here, £x(p) is the escape rate from momentum p. We define A^(s) = max(±Ax(s), 0) to be 
the positive and negative parts of Ax- We will often denote Vi,a as Va- Finally, we also define 

£\,n( x iP) = / dp'\H(x,p')? - H(x,p)?\ n J x (p,p r ), 
Jr 

The proof of the proposition below is mostly mechanical and can be found in Section |6l 
Proposition 2.1. There are constants c, C n > such that for A small enough, 

1. For all (x,p) G £, K x , n ( x >P) < C »0- + Mp\T +1 - 

2. For all (x,p) E E, JC* Xn {x,p) < C n . 

3. For all (x,p) G E, V A , n (x,p) < C n (l + A|p|). 
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c 

,2 



4. For all (x,p) G £, Aj(x,p) < J+p 

5. For all (x,p) G £, c < Va(x,p). 

Lemma 12.21 states that the energy H t typically does not go above the scale A -1 over the 
time interval [0, ?]. 

Lemma 2.2. Assume that j^dfi(x,p)\p\ n < 00. For any nGN, i/iere exists a C > snc/i £/iat 



SUp (-ffr-)^ 



0<r<5 



/or allT > and A < 1 . 



Proof. We will work with the process Qt = 2 2 if t 2 . The reader should think of Qt as being 
roughly the absolute value of the momentum \P t \. If P t were a symmetric random walk making 
steps every unit of time, then the result would hold with (H t )2 replaced by \P t \ by Doob's 
inequality (supposing that the tail distribution of the jumps decays sufficiently fast). The 
situation for our jump rates should, in principle, be even more accommodating, since the jump 
rates ( 11.21) tend to drag a momentum with large absolute value down to a momentum with 
smaller absolute value. However, for the purposes of this lemma, it is useful to discard the 
term associated with these large downward jumps in the decomposition (12. 2 j) of Q t , since it is 
less analytically wieldy, and it is not helpful on the time scales ? for T fixed and A< 1. 

For technical reasons, we partition the time interval [0, y] through a sequence of incursion 
times (q' n ) into a region of "low" energy. Let ? — <r{ — 0, and define the stopping times <r n , <^ 
such that 

q' n = min{r G (q n -i, 00) | Q r < A"^}, s n = min{r G 00) | Q r > 2A"^}. 

The above definition assumes that Qo < A - ^ which is reasonable for A 1 by the locality 
assumption on the initial distribution (2) of List II. 1\ but we should take ^1 = q[ = when 
Qo > A~i 

Trivially, we have the inequality 

sup Q r < 2A"2 4- S up (Q r — Q r -) + + sup sup (Q r — Q ? „) + . (2.1) 

0<r<t 0<r<t ?„<t re[f„,< +1 At] 

The middle term on the right side bounds the over-jumps past the line 2A~5 at the start of the 
excursions into high energy. 

Let t n , n > be the collision times with t = and recall that Mt is the number of collisions 
up to time t. We can write Q t as 

Qt = Qo + m t + m' t + / drA + (S r ) - V -7777^, (2-2) 



where the process m t is defined as 

Mi 



in, 



Ms t - 
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and where m' t is the difference 



.7 = 1 v l 7 



drA+(S t ). 



The processes m t and m[ are martingales with respect to the full filtration jF t . For m t , the 
increments A n have mean zero with respect to the information T - and that M r = M r - + 1. 

i 

The process m t is a martingale, since the terms ^rzfx m the sum occur with Poisson rate 
£\(r). Moreover, the predictable quadratic variations corresponding to the martingales have 
the forms 



(m)t 



dr V\(S r 



and 







£\(S r 



The third term on the right side of (12.11) is smaller than 



sup sup (Q t - Q fn ) < sup sup 



III; + m, 



III, 



< 2 


sup 

0<i<f 


m ( 


+ 2 


sup 




< 2 


sup 


m t 


+ 2 


sup 

°<*<T 





-m^+ f drA + (S,, 
A + (S r ) X (Qr > A"?) 



2CT. 



(2.3) 



For the first inequality, we have thrown away the term — Yljli e x (s 3 ) ' s i nce ^ ^ s strictly neg- 

3 

ative. The second inequality is the triangle inequality with sup 0<s r<t \f r — f s \ < 2 sup 0<r<i 
for / = m, m', and uses the fact that Q r > A~^ during the excursion intervals [q n , q' n+1 A j). 
The third inequality is a consequence of Part (4) of Proposition 12.11 which gives a C such that 

A + (X r ,P r )x{Q r > A"*) < C—^—x(Qr > A"*) < - -4- 1 < 2CA. 

The second inequality is for A small enough so that 4sup x V(x) < A^ 1 . 
Combining (12. 2 p with (12. 3p and using the triangle inequality, then 



E< A > [ sup Q?l " < 2CT + E< A > [QJ] » + E< A > [ sup ((Q t - Q t -) + )™ 



•o<to 



2E (A) 



1 1 n 

sup m t 


' l + 2E< A > 


I >\ n 
sup m t 


L o<t<f 




L o<*<? 



(2.4) 



We will now give bounds for each of the terms on the right side. The goal is to show that 



E^ 



sup 

0<t<^ 



Q?V = 0(\-t + J2(^ X) \ sup Q? 



■-o<t<^ 



(2.5) 



where a m > 2 and the sum over m is finite. The above would imply that E^ A ^ [sup 0<i< T Q™] is 
0(\~%), which is the statement of the lemma. 



11 



For the second term on the right side of ( 12. 41) . 

E(A) [Qo] = / dfi(x,p)(p 2 + 2V{x)Y < oo, 



and the right side is finite by our assumption on the initial measure in List 11.11 For the third 
term on the right side of (12. 4p . Using that (sup m a m ) 2 < and Jensen's inequality, we have 
the first inequality below 



0<r<f 



Mt 

T 



sup i Qr - Qr pi 5 < ew r y: ((Q <m - q j+ri 2,1 = ew r y: 



e^ 



m=l 



m=l 



i 

_><] 



dtJCl 2n (X t , P t ) 2n < / tft (1 + AQ t ) 



i 

2 ii 



sup Q[ 



o<<<- 



1 

2»- 



The first equality uses that 



E 



(Qr Qr 



2(i 



J>-, AC = AC- + 1 



^A,2n(^r, -Pr) 
^A(Pr) 



and the second equality uses that the times t n occur with Poisson rate S\(P r ). The second 
inequality is from Part (2) of Proposition 12. II The right-most term above has the from of (I2.5p . 

For the fourth term on the right side of ( 12.41) . we can apply Doob's maximal inequality to 
get the first inequality below 



E< A > 



sup mt 



"0<t<- 



< 



n 



T 



n 



-EW[|m z | n ]" < C'E«[((m)T) ! ]» +C'E^\ V |A m | n 



m=l 



= C'E W 
< C"E (A) 



n_ 1 

2 



drV x (S r 
dr (1 + AQ r ) 



C"E (A) 



drV x , n (Sr, 



C"E (A) 



dr (1 + AQ r ) 



< C"{Th~^ + T" A~™) + C"T*EW \ sup Q?l 2 " + C"T^E (A) [ sup Q r t 



i 

TT3" 



The second inequality is for some C > by Rosenthal's inequality (see e.g. [TT], Lemma 2.1). 
The third inequality is Part (3) of Proposition 12.11 We have combined the constants at each 
step. Thus we have made dressed our bound in the form ( 12. 5p . 

The last term in 12.41 is bounded similarly to E^ [sup 0<t< T |m t | n ] ™. 



The following lemma gives a bound for the expected amount of time over an interval [0, 



□ 



that the process St spends with energy smaller than A 8 for g £ [0, 1). The result is similar to 
Lemma 5.1 of [B], and we modify the proof for our purposes here. 
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Lemma 2.3. Define T t = A f Q dr x(H r < eA e ) for < g < 1. For any /ixec? T > 0, inere a 
C > stzcn inai /or small enough A and a// e > A e , 

E w [Tt] < Ceh 1 ^. 

Proof. 

For 6 > 0, let 7 be the minimum of the hitting time that H t jumps above b\~ 2 and the final 
time -r. We have the following inequalities, 

E (A) [Tt] < TP (A) [ sup H r > b\~ 2 ] + E (A) [T 7 ] 

■ T 

T 

( sup^ if r ) 'J + E< A > [T 7 ] < CT 3 ^ + E< A > [T 7 ] , 



0<r<i 



A 4 

< T— TE (A) 



0<r< A 



where the second inequality is Chebyshev's and the C > in the third is from Lemma 12.21 
With the restriction e > A e , the term following the last inequality decays faster than ea A^ 2 as 
A — > 0, so we can can focus our study to E^ A ^ [T 7 ] . The energy process H t = H(X t , P t ) behaves 
as a submartingale for time periods in which H t < b\~ 2 for small enough b > 0. More precisely, 
there exists 0<6'<l,a>0 such that for all A < 1 and all (x,p) with H(x,p) < b'X~ 2 , 

Q?W) := |e| a ; p) [H t ] \ t=Q = \j djt ((p'f + V(x) -p 2 + V(x))j x {p,p>) 

= \ I dp'((p'Y-p 2 )j x (p,p')>*. (2.6) 

From (I2.6p . we have for all m > 1, H(x,p) < b'\~ 2 , A < 1 that 

qL A) (^P) = / ^'((^ (p/)2 + ^ (x) ) m " 0/ + ^D^M) 

>mg_H m -\x,p), (2.7) 

where qm (x : p) := ^E| A ^ [if ™] | t _ Q . The inequality (j2.7p follows from (I2.6p . since f(y) = \p\ m 
is convex, and thus 

f(Y 1 ) - f(Y ) > (Y, - Y )f'(Y ) 
for Yi = i(p') 2 + V{x) and Y" = + V(x). 

JA), 



Notice that the value qm (St) is the derivative of the predictable part of the semimartingale 
decomposition of the process H™. In other terms, 

Jo 

is a martingale. In addition to the lower bounds in (12. 7ft . there are upper bounds 

q^(x,p)<a m (l + H m - 1 (x,p)) (2.8) 
which hold for some constants a m and all A < 1 and (x,p) with H(x,p) < \~ 2 . 
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1 

Using the above observations, there is a useful submartingale which is "close" to H£ . Let 
the b > defining the stopping time 7 be the b' chosen to ensure the condition (12. 6p . There 
exists a c > such that for all A small enough, 

Kt = Hf + cXHf 

is a submartingale over the time interval t G [0,7]. To see that n t is a submartingale up 

to time 7, first notice that the predictable component J^drc^\S r ) in the semimartingale 

3 1 2 

decomposition of increases with rate cxif t 2 by (12. 7\\ . Moreover, the predictable component 

1 1 f 

of the semimartingale decomposition of is 2~a J drA\(X t , Pt), and the negative part of 
the function A\ satisfies the inequality A^(x,p) < CXH^(x,p) for some C > by Part (2) of 
Proposition 14.151 and the elementary inequality \p\ < 2? (x,p). Thus we can choose c = — 
to ensure that n t is a submartingale in the specified time interval. 
Set qo = = 0, and define the stopping times q n , s' n < 7 such that 

q' n = min{r G (<? n -i, 00) | # r < eA~ e }, <^ n = min{r G (<^, 00) | H r > 4eA~ e }. 

Let n 7 be the number c^'s less than 7. In other words, n 7 is one more than the number of 
up-crossings of H r from X~ g to 4X~ e which have been completed by time 7. Let n 7 be defined 

similarly as one plus the number of crossings of n t from |e2A~2 to 2e2A~2. For A < (^;) T ^, 
then we have both of the implications 



1 3 1 

H t >4eX~ e Kt > 2e^X~i and H t <eX' e «t < -e*A - *, 

and hence n 7 > n 7 . The definitions give us the almost always inequality 

nt 

T t <Xj2^n-C 

n=l 

Next, observe that 

A^EW [TJ < E^ [ y ? n - d < [nJ sup EW k n - < U < n 7 ] . (2.9) 

We now have our upper bound in terms of the expectation for the number of up-crossings n t 
and the expectation for the time of a single up-crossing q n — q' n conditioned on the event n < n t . 
By the observation above E^ A ^ [n 7 ] < E^ A ^ [n 7 ] . By the submartingale up-crossing inequality [5] , 

E (A) [nl] < ! E [ -fej- _ e = 2e"5A*EW [if) + cA#|] < 2e"h2 (e< a > [HJ * + cAE[#|]) 
2e2 A 2 — 62 |A 2 V / 

< 2e'hf (EW [Fo] + A- 1 ^) ^ + 2ce-U 1+ i (E^ [i/ ] + A-i + iTla f ) 

<4e-^fTU i ^ i , (2.10) 

where the last inequality is for A small enough. The second inequality is Jensen's, and the third 

uses that 7 < ? and the bound (12. 8p for the derivatives of the predictable components of the 

3 

semimartingales H t and if t 2 . 

11 



We now focus on the expectation of the incursions q n — q' n . Whether or not the event n < n t 
occurred will be known at time q' n , so 

sup [, n -,' n \n< nt] < sup E^ [?„ - < | P ? , ] = sup E< A > [ft] . 
By (El), then 

aE^fft] <EW[ drq[ X) (X r ,P r )] = [if a - H ] , (2.11) 
L Jo J 

where the equality holds by the optional sampling theorem, since dr q[ x \x r , P r ) is the pre- 
dictable part of the semimartingale decomposition for H t — H . This application of the optional 
sampling theorem is legal, since ft is almost surely finite, and 

E^ A) [H rX {r < ft)] < A^p( A) [r < ft] — >0 as r — ► oo. 

Continuing with the right side of (12. lip . 

E?> [H qi - H ] = E W [H ( - - H ] + \E^ [P 2 - P 2 _] 

< 2e\~ s + 2E< A) [A 2 ] = 2e\- Q + 0(A°) < ceA _e . 

where A = P fl — P f -. For the first inequality, we have used that P 2 _ < H q - < e\~ Q , and the 
inequality (x + y) 2 < 2(x 2 + y 2 ). The last inequality holds for some c > by our restriction 
e > X e . The term E^fA 2 ] is uniformly bounded for A < 1, since by nested conditional 
expectations Ei A) [A 2 ] = Ei A) [EW[A 2 | P f -]] and 

E (A)[ A 2 | J ] _ <±_ 

^(X_y)>eA-,^'^(P fl -,P') 

/ff(a, P O> £ A-0 - P) 2 AM) 
< sup sup < OO. 

A<1 H(x,p)<e\-e J H(x,p')>e\-e d P -MP, P ) 

The equality relies on the strong Markov property, since the distribution for A is independent 
of P^- .when given S q -. The last expression is finite by Part (7) of Proposition 13.11 

Putting our results for E( A ) [n 7 ] and sup neN E^ A ^ k„ - u n < n 7 ] together, 



E (A) [T 7 ] < AE (A) [n 7 ] sup E (A) [?„ - q' n \ n < n 7 ] < Ace^T^X 



This completes the proof. 



□ 



The process A t = J Q dr A\(S r ) is the predictable part of the semimartingale decomposition 

of Q t = 2*Hf. Moreover, Af = f*drA$(S r ) are the increasing/decreasing parts in the Jordan 
decomposition of A t . The martingale part M$ of the semimartingale decomposition of Q 4 has 
predictable quadratic variation 

(M) t = / drV x (S r ). 
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Lemma 2.4. There exists C,c> such that for all t G [0, ?], A < 1 and s in a given compact 
subset of S, 

E< A) [A+] > -C + cti 

Proof. Since Q t and are positive and A/~ > is increasing, we have the following deter- 
ministic inequalities 

A+ = Q t - Q - M t + A" > -Q + sup -M r > -Q + M". 

0<r<t 

Taking the expectation of both side gives. 

E< a) [Ha+] > -2h-*H*(s) +t~5E( A )[M f -]. 
Since M t has mean zero, we have the equality below 

,(A) 



2EW[M t -l =EW[|M t |] > Es Q / |Mt|2] 



and the inequality is Cauchy-Schwarz. However, 



Ei A )[|M t | 2 ] =EWf / drV A (S r 



> ct, 



where c > is from Part (5) of Proposition l2TTl For the first inequality below, we use Rosenthal's 
inequality (see e.g. [H], Lemma 2.1) to produce a C > such that 



EW [|M,| 3 ] < CTEW [<M t )t] + CE^ 

n=l 

A/i 



M, - M, 



CE< A > [<M t > f ] + CE?> [ ]T | Q tn - Q t - 1 3 ] = CE^ [ ( f dr V> 

n=l jQ 



{S r 



dr JC x ,3{S r 



< C"E (A) 



dr (1 + AQ r 



dr (1 + AQ r 



<C"ti. 



where t n are the collision times and At is the number of collisions up to time t. The first 
equality uses that Q t and M t differ by a continuous process and thus have the same jumps. 
The second inequality is for some C > by Part (1) and (3) of Proposition 12.11 along with 
the relation \p\ < 2^ H^i^x.p). The last inequality is by Lemma I2T2"} and C" is independent of 
A < 1. 

Putting our results together 



,i Ei A) [|M t | 2 ] 



Ei A) [A+] > -2?m{s) 



c 2 l 



e! a) [|m 4 |3] 



>-2>H*(s) + —t*. 



This gives the result. 



□ 
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3 Limiting behavior for the jump process 

In this section, we push the study of the jump processes J t in ( 11.30 as far as we can without 
any analysis of the drift D t . J t can be written as a sum of a martingale M t and a predictable 
part as 

Jt = M t + I drV x (P r ) for V x (p) = [ dp' (p' - p)J x (p,p'). 
Jo Jr 

In order to write an expression for the predictable quadratic variation (M) t in terms of the 
jump rates J X) let us define for m e M + , 



IWp)= / dp'{p' -p) m J x {p,p'), and Q x (p) = n A , 2 (p) 



Up) ' 



Note that £ x , V x are just special cases of IT Am for m = and m — 1, respectively. The 
predictable quadratic variation for M t can then be written in the closed form 

(M) t = f drQ x {P r ). 
Jo 

The proof of the following proposition is also mostly mechanical and can be found in Section 

El 

Proposition 3.1. There are constants C, C m , c > such that for all A < 1 and all p £i 
Kfey < &(P) < 8(aTT) (1 + Cmin(A|p|, AV)) and A|p| < C£ A (p). 

*■ I^(p) + 2(awI<caV- 

3. \V x (p) + j$£ x (p)\<C 

I |Qa(p)-(XTIf| <Cmin(A|p|,AV)- 
5. <c< Q A (p) < C£ A (p). 

ft n Ai2m ( P )<c m (i + Ab|) 2m+1 . 

7. 

! W \> b dp'(\p'\-b) m J X {pyP') 

SUp SUp -. r < C m , 

A<l|p|<6< A -i J\ p '\>b d P'^{P^P') 

f HS d P >(\ P '\-i) m J X ( P , P >) 

sup sup — — < C m . 

a<i | P |> i J h i > i ] dp'J x {p,p') 

Next, we show that the drift rate T> x (p) can be effectively replaced by the linear form — |Ap. 
Let P[ be the solution to the integral equation 



P/ = P + J dr —(X r ) - -A y dr P; + M t 



(3.1) 



Also let us denote P t W = X^P[ X) and P t W '' = X^P[. 

A A 
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Lemma 3.2. For all fixed T > , and as A — > 0, the difference between P^'' and P^' 1 is small 
in the sense 



EW[ sup \P t {x) "-P t W \] = 0(X^ 

0<t<T 

Proof. Since Pt satisfies the integral equation 

r* dv r f 

Pt = P + dr—(X r )- / drV x (P r ) + 
Jo dx jo 

and P[ satisfies (13. ip . we have the identity 

p W,/_pW = f t drRx ( P W^ + I [ dre*W [ dsR x (P^')= [ dr e^R x (P r W>'), 
Jo 2 J J J 

where R\(p) = X~^ D\(- £ r) + ~p. Thus, we have the first inequality below 



sup IPW'-PWI 



0<t<T 



< 2(ei T - 1)E W 



< 2X^Ce^ T E^ x) 



sup I A 2 Pt 



0<t<T 



(A), 

2XCe^ T E^ 



sup \R\(P t 

0<t<T 



sup A 2 Pi 

LQ<t<T A 



0(A*). (3.2) 



The second inequality follows by Part (2) of Proposition 13.11 By bounding \p r \ < 2^H r 2 and 
applying Lemma [2^2] the expectations on the right side are uniformly bounded for A < 1. Thus, 
the left side is O(Aa). 

□ 

The following lemma gives a central limit theorem for the martingale M$ = X^M±. 

Lemma 3.3. As A — > 0, the martingale M t = A^Aft converges in law with respect to the 
uniform metric to a standard Brownian motion B' over the interval t G [0,T]. 

Proof. To prove the central limit theorem, we prove the following: 

(i). For each t G M + , the predictable quadratic variation process (M^) t converges in proba- 
bility to t as A —7- 0. 



(ii). For any e > 0, then as A — > 

p(A) 



SUp (M r - M r J) 2 > ~ 

A 



0<r< 



0. 



By [29l Thm. VIII. 2. 13] this is sufficient to prove that M t (A) converges in law to a Brownian 
motion. 

(i) We prove a somewhat stronger statement. Note that 

„* 

(M^) t -t = X I dr(Q x (P r )-l). 
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For the expectation of the supremum of the difference between (M^) t and t over the interval 
[0, T], we have 



E< A > 



sup 

te[o,T] 



(M™) t -t 



< AE (A) 



dr\Q x {P r )-l\ 



< CT\*(\* +E (A) 



0{K 



sup A 2 |P r 



0<r<; 



sup A|P r 



0<r<- 



where the second inequality is for some C > by Part (4) of Proposition ^. II The expectations 

in the second line above are bounded uniformly for A < 1 by Lemma [2.21 since \P r \ < 2^H r 2 . 
The above implies that (M^)j converges in probability to t. 

(ii). Recall that AC is the number of collisions over the time interval [0, t] and that t±, . . . , tj^ t 
are the corresponding jump-times. 



A 



PW [ suP t (M r - M r -) 2 > £] < I ( £ (M tn - M t -J 



4 \ a 



n=l 



T 



n=l 
Nt 

T 



E (A) ^ E (A) (M r - M r -) | P r - , AC = AC- + 1 



n=l 

N't 

t 



e £ A (P t - 

n=l L ™ 



r=t„ 



drU XA (P r ) 2 . (3.3) 



The second inequality is Jensen's, and the first inequality is Chebyshev's followed by the ele- 
mentary relation 



sup a m < ( ^ 



l<m<n 



a n > 0. 



The second equality uses that a jump for M r is a jump for P t ( since they differ by a continuous 
quantity) and that the conditional expectation for [P r — P r -) 4 given the value P r - and the 
information that r is a jump-time can be written in terms of U\\P r -) and £\{P r -) as 



E^ 



(M r - M r -) 4 | P r - , A; = M r - + 1 



E< A > 



(P r - P r -) 4 1 P r -, AC = AC- + 1 



n M (Pr- 



The last equality follows because the jump times t n occur with Poisson rate £\(P r 
Squaring the right side of fl3.3|) . 



■ A, 



A. 



(-) EW / drU XA (P r ) < (-)'EW / dr(l + X\P r \) 



A / dr(l + \2^H?) 1 
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where we have applied Part (6) of Proposition 13. II in the first inequality, and the bound |-P r | 2 < 
2H r for the second. By Lemma 12.21 the expectation on the right side is uniformly bounded for 
A < 1. Thus, the above is 0(A) and the left side of (13.31) is 0(Az), and we have proven the 
Lindberg condition. 

□ 

4 Bounds involving the cumulative potential forcing 

Section H~T1 defines the splitting structure that we use, and includes a few elementary formulas 
regarding it. Section 14.41 makes the estimates that convert the rapid oscillations of the force 
^-(X r ) when \P r \ ^> 1 into decay for various integrals of the force. Finally, Section I4T51 bounds 

the momentum drift D± = j Q x dr ^(X r ) for A < 1. 
4.1 Nummelin splitting 

We must start by introducing some extra artificial structure to our process. The split process 
that we define here is a reduced version of that in [21]. In the context of a larger probability 
space, the drift in momentum D t = Jq dr ^(X r ) may be viewed as a martingale plus a few 
small "boundary" terms. This allows us to apply martingale techniques. For those familiar 
with the terminology related to Nummelin splitting, we outline the extension of the process as 
follows: We introduce a resolvent chain embedded in the original process, we split the chain 
using Nummelin's technique, and we extend the resolvent chain to a non-Markovian process 
which contains an embedded version of the original process. 

Let (e m ) be a sequence of mean-1 exponential random variables which are independent of 
each other and of the process (X t , P t ), and let r n = Y^m=i tm with the convention r = 0. The 
T n will be referred to as the partition times. Define N t to be the number of non-zero r n less 
than t, and the Markov chain a n = (X Tn , P Tn ) G S, which is referred to as the resolvent chain. 
The resolvent chain has the same invariant probability density as the original process. Let 
7a be the transition kernel for the chain, which acts on functions from the left and measures 
from the right. Recall that for a Markov chain, an atom is a nonempty set a such that the 
probability transitions starting from a point x G a are independent of x. An atom is said to 
be recurrent if, when starting from the atom, the probability of returning to the atom in the 
future is non-zero. In general, a Markov chain does not necessarily have a recurrent atom. 
The splitting technique we outline presently, originally due to Nummelin, allows us to create a 
recurrent atom from a certain minorization condition. The idea is to extend the state space £ 
to E = S x {0, 1} in order to construct a chain (a n ) G X with a recurrent atom and have the 
statistics for (a n ) embedded in the first component of (a n ). Let v be a probability measure on 
£ and h : £ — > [0, 1) be such that 

T x (s 1 ,ds 2 )>h(s 1 )u(ds 2 ). (4.1) 

By Part (1) of Proposition I4.3[ we may even pick a single pair v and h satisfying (14. ip for all 
A < 1. We have the following transition rates from the state (si,2i) G £ to the infinitesimal 
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region (ds 2 , z 2 ): 



' TE$$(Tx-h®v){s u ds 2 ) z 1 = z 2 = 0, 

J h&L (%-h® v) ( Sl , ds 2 ) Zl = i- Z2 = o, 

(l - h(s 2 ))v(ds 2 ) z\ — 1 — z 2 — 1, 
h(s 2 )v(ds 2 ) Zi — z 2 = 1. 



Given a measure /1 on E, we refer to its splitting fi as the measure on E given by 

fl(ds,z) = x(z = 0)(l - h(s)) fi(ds) + x(z = l)h(s)fi(ds). (4.2) 

In particular, the split chain is taken to have initial distribution given by the splitting of the 
initial distribution for the original (pre-split) chain. The set E x 1 is an atom since the transition 
measure from (si, 1) is independent of s±. Moreover, it is a recurrent atom, since our original 
process is exponentially ergodic to \I/oo,a by Theorem IA.14 and as a consequence, the split chain 
is exponentially ergodic with respect to the invariant state \I/oo,a (see Part (2) of Proposition 14. 3ft 
which has ^/^^(S x 1) = ^oo,x(h) > 0. Notice that the conditional probability that z 2 = 1 
given si, Zi, s 2 is determined by a coin with head-probability h(s 2 ). 

Using the law for the split chain (<j n ) we may construct a split process (St) G S and a 
sequence of times f n with the recipe below. The f n should be thought of as the partition 
times r n embedded in the split statistics, although we temporarily denote them differently to 
emphasize their axiomatic role in the construction of the split process. Let f n and S t = (S t , Z t ) 
be such that 

1. = f , f n < r n+ i, and f n — > 00 almost surely. 

2. The chain (Sf n ) has the same law as (a n ). 

3. For t G [f n , f n+ i), then Z t = Z fn . 

4. Conditioned on the information known up to time f n for S t , t G [0, f n ] and f m , m < n, 
and also the value S? n+l , the law for the trajectories S t , t G [f n , r n+ i] (which includes the 
length f n+ \ — f n ) agrees with the law for the original process conditioned on knowing the 
values Sf n and S? n+1 - 

The marginal distribution for the first component St agrees with the original process and the 
times f n are independent mean-1 exponential random variables which are independent of St- 
Of course, the times f n are not independent of the process S t , and we emphasize that the 
increment f n+ i — f n is not exponential given the state SV n - The process St is not Markovian, 
although, as emphasized in [21], the process (St, Z t , S T a)) G S x {0, 1} x E is Markovian, where 
r(t) is the first partition time f n following time t. We now drop the tilde from f n , and use (a n ) 
to denote the sequence (S Tn ). We refer to the statistics of the split process by and P (A) for 
expectations and probabilities, respectively. 

Now that we have defined the split process St, we can proceed to define the "life cycles". 
Let R' m be the value r„ m for h m = min{n G N | Ylk=oX(Z Tk = 1) = m}. In other words, R' r 



in 



is the mth partition time to visit the atom set E x 1, and we use the convention that R' = 0. 
Define R m , m > 1 to be the partition time following R' m . The mth life cycle is the time interval 
[R m , Rm+i). Intuitively, it may at first seem more natural to define Sr^ as the beginning of the 
life cycle. However, since the distribution for R[ will depend on the initial distribution S . It 
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is more stable to consider the beginning of the life cycle to be the partition time R m following 
R' m , which has distribution v with respect to information known up to time R' m . Although the 
conditional distribution for S^ m is invariant of the value Sr> £ Ex 1, successive live cycles 
[R n -i, Rn), [Rn,R n +i) are obviously not independent, since for instance a.s. lim t AR n S t = S^. 
Let dN t be the counting measure on R + such that f, ti t , dN r = N t2 — N tl for < t\ < t 2 
(i.e. the number of partition times over the interval (ii,^])- The following proposition is a 
consequence of [21] and lists the independence properties that we have. 



Proposition 4.1. 

1. The distribution for Sn n is v when conditioned on all information known up to time R' n : 
Fr' ■ 

2. The sequence of trajectories (St, dN t : t G [R n , R' n+1 ]) are i.i.d. forn > 1, and (S t , dN t : 
t G [Rn, R' n+ i}) is independent of (S t , dN t : t G M + — [R' n , R n +i}) ■ 

3. The trajectory (S t , dN t ■ t G [R n , R n +i}) is independent of (S t , dN t : t G 1R + — 
[R' n , R n +2\) ■ In particular, (S t , dN t : t G [R n , R n+ i\) is independent of [S t , dN t : t G 
[R rn , R m+1 ]) for \n-m\> 2. 

Unfortunately, notation multiplies when the splitting structure is invoked. For easier refer- 
ence, we list the following frequently used symbols: 

St = (St, Z t ) State of the split process at time t. 

r m G M + mth partition time. 

a m = S Tm mth state of the split chain. 

(<7 m , Cm) = 5"m o"m and Cm are the state and binary components, respectively, of a m . 

N t G N Number of partition times up to time t. 

R' m G M + mth partition time visiting the set E x 1. 

R rn G M + Partition time succeeding R' m and the beginning of the mth life cycle. 

N t G N Number of returns to the atom up to time t. 

n m G N Number of partition times in interval (0, Rm]. 

H — > ji The splitting of a measure fi on E as defined in (14. 2 j) . 

Tt Information up to time t for the original process S r and the r m . 

Tt Information up to time t for the split process St and the r m . 

T't Information for St and the r m before time R n +i, where R' n <t < R' n+1 , 

There is some flexibility in the choice of v and h in the criterion ( 14. ip . although choosing 
them invariant of A > adds a little extra constraint. By Part (1) of Proposition I4.3[ we can 
select a pair v, h that is invariant of A, and where both are functions of the energy. We will 
use the symbol v for both the measure and the corresponding density. 

Convention 4.2. We take v and h of the form 

h(s) = u— — ^ ~ - and v(ds) = ds— — ^ ~ - , 

where 1 = 1 + sup x V(x), U > is the normalization constant of v, and u G (0, U) is from Part 
(1) of Proposition \4-3\ 



22 



The compact support of h means that the extended state space for the split dynamics is 
effectively S x 0U{s G E | H(s) < 1} x 1. Any supremum, minimum, etc. over E refers to this 
contracted set. 

In words, Part (3) of the proposition below states the following: given the information that 
time r is a partition time (i.e. N r = N r - + 1) and the information then the probability 
that particle is at the atom at time r (i.e. Z T — 1 ) is h(S r ). This holds regardless of whether 
the state of the process St was already sitting at the atom before time r. For the partition 
times r = r n , n £ N, knowledge of the state S r is contained in T r - almost surely, since a.s. 
lim„yy S v = S r . This limit will hold since the partition times and the collision times in which S r 
jumps will a.s. not coincide. The proof of Part (1) of Proposition 14.31 is contained in Section [61 
and the other parts of the proposition are immediate consequences from the definition of the 
split statistics. 

Proposition 4.3. 

1. There is a constant u > such that the h and v in Convention \4-2\ satisfy T\(s,ds') > 
h(s)u(ds') for all s, s' £ S and A < 1. 

Also, the transition measures T\(s,ds') have densities over the domains {s' £ S | H(s') ^ 
H(s)}, which have the following bound 

Tx(s,ds') ^ 
sup sup < oo. 

A<1 H(s)>l ds 
H(s)^H(s') 

2. The invariant state of both the split chain (a n ) and the split process (St) is the splitting 
of the invariant state of the original process, i.e., 

*oo,aM) = (1 - /i(s))* 00iA (s) and y 00iX (s,l) = h(s)V 00:X (s). 

Thus the "atom" has measure JL ds h(s)^! oo,a( s ) ■ 

3. [Z r = 1 | F r -, N r - N r - = 1] = h(S r ) 

Besides the nearly independent behavior of the process St over the intervals [R m , R m+ i), the 
payoff for introducing the splitting structure includes the closed formulas in Proposition 14.41 
For Part (3) and (4) of the proposition below, £R( A ) is the reduced resolvent of the backward 
generator C* x 

POO 

<R w g= / dre rC *(g), 
Jo 

which operates on g £ L°°(S) with ^oo,x(g) = 0. The reduced resolvent is well-defined since 
the process St is exponentially ergodic by Theorem I A. 11 As A — > 0, the expression in Part (4) 
is related to the diffusion constant k appearing in [3, Thm. 1.1]. 

Proposition 4.4. 

1. For g £ L°°{t), 

Lds¥£(s)g(s) 



m=0 m=l 



J s ds^\s)h(s 



In particular, if g £ L°°(S) does not depend on the binary variable, then the numerator 

[ t d3*&\8)g{s) = f s . 



on the right sides above is equal to J^ds^/^ls) g(s) = L, ds ^> oo,x{s) g{s) . 
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2. For g G L°°(E) 



E 



(A) 



Ri 



drg(S r 



/ E ds ^oo,a(s) g(s) 



J^ds^oo^is) h(s)' 



3. For g G L°°(E) with #oo,a(o) = and s h s 2 G E, 



drg(S r ) 



E 



(A) 



Hi 



drg(S r ) = {^g)( Sl )-{^g)(s 2 ) 



where 5 S is the splitting of the 5 measure at s G E. 
^ For o G L°°(E) with *oo,a(o) = 0, 



E 



(A) 



R2 



drg(S r ) / dr'g(S r >) 



fx dsW 00^(3) h(s) 



Proof. 

Part (1): This follows as a general fact for split chains [28] when the original chain a n is Harris 
recurrent with normalizable invariant measure ^o^a- The distribution for a m for m = and 
m = hi + 1 is v, so counting over [0, hi] or [1, tt-i + 1] has the same result. 



Part (2): Let g { n ] : S 2 R and gW : E R be defined 



as 



9 { n\s,3') 



E (A) 

Ei A > 



drg(S, 
drg(S r ) 



3' = S T 



Also define : E — > R analogously to g^(s) with E^ replaced by E 
We have the equalities 



(A) 

s 



E 



(A) 



ni 



drg(S r )\ =E^[J2& X) 

n=0 



Tn + 1 



drg(S r ) 
f^ds^oc 



°~ni °~n+l 



E 



[E- 

n=0 



n.=0 



(4.3) 



where the second equality holds, since the statistics for S r over an interval (r n , T n +i) given the 
values a n = (a n , ( n ), a n+ i = (cr n +i, Cn+i) is independent of £ n , Cn+i and is the same for the split 
and the original dynamics. The third equality is from Part (1). 
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Starting from the right side of (14.31) . 



/srfstt 0O , A (s)gW(5) 
/ E ds^ 00 , A (s)/i(s) 



is 



J E ds^ 00iA (s)/i(s) / s ds* (50) A(s)/i(s) 
/ E ds * oo , A ( S )Ei A) [/ oo rfre-^(S' r . 



^(is^oo^s)/^) 



/ °° tfr ( Jg rfs froo,A(s)Ej A} [g(gj] ) 

J s cis* 0Oi A(s)/i(s) 
J oo (ire- r (/ E c/s* OOiA (s)5f(s)) _ f^ds^^xis) g(s) 



J E ds !X (s)h(s) 



/ 2 ds *oo,a(s) /i(s) 



(4.4) 



The first equality uses that ^oo,x has the split form in Part (2) of Proposition I4.3[ and the third 
uses that T\ is a mean-1 exponential independent of St in the original statistics. The fourth 
equality is Fubini, and the fifth is due to the stationarity of \I/oo,a- 



Part (3): The reduced resolvent is a pointwise limit 

dr e~ ri g(S r ) 



(9\ {x) g)(s) = limE^ 



lim M A) 

7\0 5 s 



dr e 7-7 (7 (SV) 



Jo 



where the second equality is embedding the expectation in the split statistics. However, for 
Si, S2 G S, 



Ei A) - E (A) 



dr e r " y g(S r 



E (A) - M A) 



Ri 



dr e n g(S r 



since the distribution for the state is v regardless of the initial measure. Using the above 
equalities 



(m {x) g)( Sl ) - (m w g)(s 2 ) = lim fe A) - E (A) 



7\0 V 5 n 
,<-Ri 

E 



Ri 



8 B1 



drg(S r 







dr e rj g(S r 

»Ri 



E 



(A) 
S S1 



drg(S r 



where the limits are well-defined since the process St is positive-recurrent by IA.1I and hence 
E^^] is finite for all s G S. 



Part (4): Notice that 



E 



(A) 



Ri 



drg(S r 



'-jo 



R 2 



dr'g(S r 



M A) few 



n =0 J T " 



R 2 



drg(S r ) / dr' g(S r > 



E f(A) K)L 



n=0 



(4.5) 
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where f W : £ — > M. is defined as 

fW(s): 



E 



(A) 







fl 2 



drg(S r ) / dr' g(Sr') 



The equality ( 14. 5ft uses the strong Markov property at the times r n for n G [0, hi], and the split 
5-distribution 5 S is a consequence of Part (3) of Proposition 14.31 The function f( A )(s) can be 
rewritten as 



fW( s ) = M A) 



drg(S r ) dvg(S v )+( dr g(S r ) E (A) / dr g(S. 



R2 



E 



(A) 



drg(S r ) dvg(S v )+( dr g(S r ))(^g){S T1 ) + c dr g(S r ) 



EW[ r drg(S r )(rr\ {x) g)(S r )+c H dr g(S,, 



(4.6) 



where c G K is the constant such that (9\( x ^g)(s) + c = E-^ J^ 2 dr g(S r )\ for all s G X by Part 

(3). The value for c depends on g and the choice of v, h defining the Nummeling splitting. For 
the second equality, we have used that 



R 2 



drg(S T 



6s n 



drg(S r ) = (X W g)(S Tl )+c, 



where the first equality is by Part (3) of Proposition 14.31 The third equality of (14. 6 j) follows by 



replacing with Es A; , and then using a nested conditional expectation with respect to T r 
for r < T\. 



E^ 



dvg(S v ) + (^g)(S T1 ) 



E 



(A) 



dvg(S v ) + (m^g)(S T1 )] = (mWg)(S r ). 



We can then plug our expression ( 14. 6 p for fW(s) into (14. 5ft and invert the first two steps of 
the proof to obtain that 



Ef A) 



Hi 



R 2 



drg(S r ) / dr' g(S r 



E 



(A) 



Hi 



drg{S r )(pt x >g)(S r )+c / dr g{S, 



Ri 



Finally, we can apply Part (2). The constant c disappears from the expression since \l/ cx>,\{g) 
0. 



□ 



Lemma 4.5. Let the process M t be defined as 

n=l J "» 

T7ie process M t is a martingale with respect to the filtration J : ' t . Moreover, the predictable 
quadratic variation {M) t has the form 

Nt 

71=1 
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where v\ : £ — > 
v x {s) 



2E ( f A) 



o 



zs defined as 



^')((«<^) (S ' 



In t/ie above, 5 S is the splitting of the 8 -distribution at s. 



Proof. The jump times R' n for the martingale M t are predictable with respect to the filtration 
J-" t , although we will show that the values of the jumps still have mean zero with respect to the 
information known before the time of the jump. We can rewrite the martingale as 



M, 



N t 

£ 

n=l 



Rn-i 



Rn 



ax 



d Ax r 

dx 



Rn 



Rn-t 



Rn-t 



d Ax r 

dx 



since by Part (3) of Proposition 14.31 



£(A) 



r? dx 



Rn 



E 



(A) 



5s 



R « J Rn 



dr— (X r )] = (^^)(S Rn ) + c, (4.7) 



dx 



where the second equality is for some c e K. depending on g, u, h by Part (3) of Proposition 14.111 
The jumps of M t have mean zero since for t < R' n , the conditional expectation of the nth 
jump given T' t is 



Rn-\ 



Rn 



dx 



E^ 



Rn^ 



Rn 

R n+2 d y 

dr lx~^ X ' 

p LLJb 



Rn 



R 



n+l 



P 



E 



(A) 



dx 



0. 



For the first equality the first two terms cancel by a nested conditional expectation, since 
T' t C Fr-- For the third term in the first equality, we use the strong Markov property at 
the time R n +\ and that Sn n+1 has distribution v when conditioned on (and thus when 
conditioned on ^F' t ) by Part (1) of Proposition 14.11 The second equality is by Part (2) of 
Proposition 14.41 with g(x,p) = since ^oo,\{^nr) = 0. Thus M t is a martingale. 

For t G [R' n -i, R'n), the a-algebra T[ contains all information before time R n (i.e. T[ = JF R - 

). The predictable quadratic variation (M) t must have the form of a discrete sum over Yln=v 
since the jump times R' n are predictable. As a consequence of Part (3) of Proposition 14.31 
the conditional distribution for Sn n given T' t is o~s Rn - The variance of the nth jump for M t 
conditioned on T' t is vxi^S^), where 



E 



(A) 



\r^(X r 







2E 



(A) 



Ri 



dx 

dV , 
dr—(X r 
dx 



E 



R> 



(A) 



\r^(X r 



dr' A ^(X r 
dx 



dx 
+ 



+ E 



(A) 



- E 



Ri 



R \ dV (Y 

Rl ax 

Rl dV 



2-i 







dr — (X r 

dx 



L </0 



d Ax r 

dx 
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This expression for V\(s) can be written in terms of ^) as in the statement of this lemma 



by using that (D\^ %)(s) = 
Part (4) of Proposition 0~4l 



E 



(A) 



— c and the same reasoning in the proof of 



□ 



4.2 Behavior in the A <C 1 regime 

In the section, we are interested in a few estimates regarding "how long" the life cycles typically 
are for small A and in the asymptotics transition rates 7a- We start off with a little lemma 
regarding the length up to the first partition time T\ depending on the initial state s6S. 

Lemma 4.6. For c = ^ V jjz^, the following inequality holds: 

supsupE- [<5 t (ri)] < ce _t , 

where E~ A ^ [<5t(Ti)] refers to the density of the random variable T\ at the value t > 0. As a 
consequence, 

sup sup E£ A) [(Ri - R[) n ] < oo. 
a<i set 

Proof. In the original dynamics, T\ has a mean-1 exponential distribution regardless of the 
initial state. Splitting the distribution starting from s G S yields the equality 

e"* = E?) [tf t ( Tl )] = (1 - h(s)p£ 0) [5 t (n)} +h(s)ti$ 1) [6 t (T 1 )]. 

For s with H(s) > I, we have h(s) = and no splitting occurs, and thus c = 1 is sufficient for 
the inequality. For s with H(s) < /, then c = inf#( s )<; V = ^ V jjz^, where I, u, and 

U are defined as in Convention 14.21 The bound for the moments of Ri — R[ follows because R± 
is the first partition time after R[, and by the strong Markov property for the chain a n = S Tn , 
since R[ = for the hitting time hi 

□ 

For any fixed A, the original dynamics is exponentially ergodic to the equilibrium state \&oo,a- 
Thus the split dynamics converges exponentially to the \I/oo,a, and the time span R\ and the 
number of partition times hi during a single life cycle will have finite expectation. However, the 
process St = {X t , Pt) behaves more and more like a random walk in the Pt variable as A — > 0, 
so we should expect that 

E [ - X) [Ri] — ► oo and E^ A) [hi] — ► oo, 

as A — > 0, since the return times of a random walk to the region around the origin have infinite 
expectation. However, random walks do have finite fractional moments < | for their return 
times, and Part (2) of Proposition 14.71 states the analogous property for our process. 

Proposition 4.7. Let hi and Ri be defined as above. 

1. There is a C > such that for A < 1 

4 A) N < CX~^ and E { - X) [Ri] < CA~i 
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2. Each fractional moment < a < \ is uniformly bounded for A < 1 

supE£ A) [n?] < oo, and sup E^ A) [R°] < oo. 

A<1 A<1 

Proof. 

Part (1): By Part (1) of Proposition 14.41 for the constant function g(s) = 1 



E { - X) [ni + 1] = f - _ x 7~~\~i~7~~\ = A"^ r v 7'w x 



(27T) 



0(1), 



where the order equality is for small A. The inequality follows similarly by Part (2) of Propo- 
sition SI] for e£ a) 

Part (2): We can prove the result through the Laplace transform by showing that there is a 
C > such that 

sup |E^ A) [e" 7 " 1 ] - l| < C73 and sup IeJ, A) [e" 7i?1 ] - l| < C 7 i 

A<1 A<1 

The proof for R\ and h\ are similar, and we focus on R\. Also, it is sufficient to prove the 
result with R[ rather than Ri, since, by Lemma [4.61 Ra — R[ has finite expectation. We will 
study the following regimes for 7: 

(i) . 7 < A, 

(ii) . A < 7 and 7 sufficiently small. 

The case (i) can be shown with a simple linearization around 7 = 0. As a result of Part 
(1), there exists a C > such that 

p(A) j e _ 7Rl j _ ^ < 7 ^(A) j^j < C" 7A -i 

When 7 < A, the bound on the right side is smaller than C72. 

For the regime (ii) we can no longer rely on the first derivative of the Laplace transform, 
since the upper bound is growing as 0(A~2) . In the analysis below, we will work toward 
showing that there is a c > such that for all < t < A" 1 and A < 1 

|EW [e-' R 'A - 1| < — — ^ . (4.8) 

' L J ' cEl A) [/ 7 drA + x (S r )] -c- 1 

By Lemma 12.41 there is d > such that all 7 > and s in a compact set 

E?)[ I" drAt(S r )]>d^ 
Jo 



2 - d~\ 



With the result of (i), this is sufficient to prove the result. 

We give some preliminary bounds which will be useful. The difference between E-^ [e -7 ^' 1 ] 
and 1 is smaller than 



|lfr(A) r — 7_R' 1 1 I 00 _i 00 

-i| < g ifej^ s (E £ ?V lK; r)~ <4 A, [E^ 



m=l m=l 



" 1 . (4.9) 
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The third inequality follows, since 



-7^ 



where the R' n — R n _i are independent by Part (2) of Proposition |4~T1 and distributed as R'± when 
the initial distribution of the split process is z>, and the inequality is from Yln=i R'n~Rn-i < R' m - 
By ( 14.91) . it is sufficient to give a lower bound for E~ [ J^m=i e_TRm ] • This term can be rewritten 

as 



41 E 



m=l 



E 



(A) 



m=l 



r 



m=l 



where the first equality is from the definition of the times R' m = r^ m , and the second equality 
is by Part (3) of Proposition 14.31 The right side above is equal to 



m=l 



»i = l 



(4.10) 



The first equality uses that the argument of the expectation is a function of the pre-split 
process, and that v is the splitting on v. The last inequality is an equality for the first term, 
since the terms e _7Tm h(a m ) = e _7Tm h(S Tm ) in the sum occur with Poisson rate one, and thus 
Emli e~~ (Tm h(Sr m ) - /„ e" 7r /i(£V) is a martingale. 
So far, we have shown that 



\E^[e-^]-l\ < 



(4.11; 



Now, we find a lower bound E~ A ^ J 7 dr /i(<SV) 
h replaced by At. Define the constant 



J 7 dr h(S r 
in terms of the same expression except with 



"A 



J^ds ^(5)^(3)' 
By the triangle inequality and going to the split statistics, 



< 



(A) 



E 



4 A) [N^ - 1] 



dr h(S, 

E 



E,W 



w -1-1 



n=l 



Rn- 



dryi(S r ) — u\Al(S r 



E 



(A) 



R 



n+l 



f^ds^ oo,x(s) h(s) 



dr(h(S r ) + u\A x (S r 

J R„ 

J J1 ds^ 00>x (s)(h(s) + u x At(s) 
J s ds^ 00iA (s)/i(s) 



where N t is the number of life cycles to be completed by time t. The inequality covers the 
leftover interval [R n , ■y~ 1 } of the integration. For the equality, we have used the Markov property 
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at the times R n (since the embedded chain a n = Sn n is a Markov chain) and Part (2) of 
Proposition 14.41 Moreover, 

Ldse~ XH ^h(s) Ldsh(s) / \ \ u 

where u = f^dsh(s), and the inequality is for A small enough. The third equality is by Part 
(5) of Proposition 14.151 These observation imply that 



- 2 " 



dr^+(S r ) 



L ./o 



2. 



Plugging this inequality into (14. lip gives (14. 8p . 



□ 



The following lemma shows that the transition rates given by 7a and 7a are close to those 
for 7o and 7o for small A provided that the momenta for the starting point is small enough and 
the size of the jump in momentum is not too large. The second part of Lemma [4.81 could have 

3 

a bound of order A a with more work. The constants u, U in the statement of Part (1) are from 
Convention 14.21 

Lemma 4.8. Let \p\ < A~3 and \p — p'\ < A~s. 

1. There exists a,c> such that for small enough A 

\T\(x,p; dx', dp') — %{x,p; dx', dp')\ < c\±%{x,p; dx', dp'). 
Consequently, for b = 



\T\(x,p, z; dx', dp', z) — %(x,p, z; dx', dp', z')\ < cb\ 4 %(x,p, z; dx', dp', z'). 

2. For F G define g ( v X \s, s') = E { s x) [( £ dr F(X r )) v \ S T = s'] . For each v > 1, th 

exists c v > such that for all A < 1 



ere 



dx' dp' %(x , p; dx', dp' 



g { v X) (x,p; x',p') - gf\x,p; x',p') 



< c„A2 . 



Proof. 

Part (1): The inequality for the difference between T\(s, z; s' , z') and To(s, z; s' , z') follows 
from the inequality for the difference of 7a and 7o by an analysis of the cases z G {0, 1}. The 
difference is identically zero when z — 1. For z — 0, 

\T x (s,0;ds',z') -%(s,0;ds',z')\ = h(s' ', z')\T x (s, ds') - T x (s, ds')\ 

<h(s',z')c\^%(s,ds') < (1 - /i(s))" 1 cA37l)(s,0;ds / ,0), 



where h(s, 1) = 1 — h(s,0) = h(s). Moreover, (l — h(s)) 1 < — - = b by our choice of h i 



in 



Convention 14.21 The second inequality above follows by discarding the second term on right 
side of the equality 

(l-h(s'))%(s,ds') = (1 -h(s))fo(s,Q;ds',Q)+ (l - h(s'))h(s)du(s). 
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Now we focus on the difference of 7a and 7o- Let = R™ x S n (t) and T t = U^L o rf^ for n- 
simplicies S n (t) = {(ti, . . . , t n ) G [0, t] n \ t m < t m+1 }. Given an element £ = (pi, *i; . . . ; p n , t n ) G 

let (^ r (^),p r (^)) be the trajectory which begins from (x,p) and evolves deterministically 
according to the Hamiltonian H over the intervals [t m ,t m+ i) (with the convention t = and 
t n+ i = t), and with pt m (£) = Pm for m G [1, n]. In other words, there are collisions at the times 
t m at which the momentum of the particle jumps to p m . The kernel 7a can be written as 

T x (x,p;dx',dp')=E\ [ dC5(x' -x T (0)S(p' -p T (C))Di x \0 
l Jr T 

where the expectation is with respect to the exponential random variable r, and the density 
Z?£^(£) has the form 



p(A) (e) 



Jx(PtT>Pti) e ' 



■/^«6-£ A (Pr(0) 



Let r T A be the set of all trajectories such that |p r | < 2A * for all r G [0, r]. By the triangle 
inequality, 

\T\(x,p; dx\dp') — To(x,p;dx',dp')\ 



< E 



L Jr T 



2supE 



A<1 <-Jr 



d£5(x' - x T (0)S(p'-Pr(0) \Di X \C) ~ D®(t)\ 



dC5(x'-x T (0)S(p'-Pr(0)Di x \0 



(4.12) 



For the second term in ( I4.12p . 



sup E 

A<1 



d£5(x' - x T {i))5(p' - p T (0) D[ x \0 = supEW [T x (x q ,p,; dx', dp' 



A<1 



< I sup sup 

^ A<1 \ p \2>i 

= 0{e- sx ~ k )dx'dp\ 



T\(x,p; dx',dp')y 



dx'dp' 



sup P 

A<1 



sup \P r \ > 2 A 4 

L 0<r<r 



dx'dp' 

(4.13) 



where ^ is the hitting time that \P r \ reaches above 2A", the constant 5 > is picked below, 
/ > is from Convention 14.2} and O(-) refers to the limit A — > 0. The first equality holds by 
the strong Markov property starting from q and by the definition of the transition kernel 7a- 
The inequality is for A small enough so that £ < 4A~2 ; and the first term on the second line is 
finite by Part (1) of Proposition 14.31 The order equality is argued below and follows since r is 
exponentially distributed, and the momentum jumps have uniformly bounded Gaussian tails 
from points \p\ < A~4 and occur with Poisson rate < <?a(A~ 3 ) ~ |. The right term on the 
middle line of ( I4.13P is bounded by 



p(A) 

0,p) 



sup \P r \ > 2A 4 

0<r<T 



< P 



(A) 
(z,p) 



E l^-^l>A-i 



n=l 



<F[k t > < e~ sx ~ k E[e 5 ^}. 



(4.14) 
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where n t is an increasing Levy process starting at zero and with jump rates 



Tj 12 12 

j(u) = sup sup Jx(p\p") < 7: ve ~* v < ce~v v . 

- |p |<2A ^ 
„=|p"-p'| 

The bound on j(t> ) is for some c > 0. The first inequality in (I4.14p holds since \p\ < A~^ and 
so the total variation in the process must reach A~4 for \P r \ to reach above 2\~i. The second 
inequality in (I4.14p holds since the rate of jumps \P r — P r -\ = v occurs with rate smaller }(v) 
at all r < q A r, and the third inequality is Jensen's. The exponential moment on the right side 
of ( I4.14p is finite for small enough 5, since 



E[e SKr ] = E 



-f m +dvj(v) (e*»-l) 



dvj(v) (e 



5v 



We have given an upper bound of 0(e 



for (I4.13p . However, there is a d > such 



that 



d%(x,p;dx',dp') > A^e^ A ^dx'dp' > e~ a *dx'dp', 



(4.15) 



where both inequalities are for A small enough. The first inequality follows by an analysis of 
the contribution to To from the event that two collisions occurred over the time interval [0, r] 
as in the proof of Part (1) of Proposition 14.31 This gives us the bound for the second term on 
the right side of (I4.12p . 

Now we bound the first term on the right side of (I4.12p . For £ £ and m £ [0, n], define 

n t ■ 



j=n—m+l 
n—m 



/JjdrfibCPrtt)) 



■ HMi> f (o.i>, (ay 

By convention, we also set D^n+iiO = D^ x \^) and -D| A q(£) = D^\^). By a telescoping sum 
for each n and the triangle inequality 



E 



oo n „ 

E E 4 / M d t 5 ( x> - XriOW-PriO^l+liO - 



< 

n=0 m=0 

oo 

n=0 



E 



(n) 



dfi(i'-i T ({))«( ! i'-p 1 .({))B<. »({) 



5(x' -X T )5(p' -P T )M T e 



(4.16) 



where we will explain the second inequality below. The third inequality follows be extending 
the integration over the domains T^l to the larger domains T^. 
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Note that when \p\, \p r \ < 2X 4 for r E [0,t] and \p — p'\ < A 9, 

I J\(p,p')e-ti drEx{Pr) - J (p,p')e~ti dr£oM \ 

< \ Jx{p,p') - J {p,p')\e-ti dr£ ^ + J (p,p')\e-ti dr£ ^ 

< (const) \p - p'\ + A3 ^jJo(p,p')e~^ < CX^J (p,p')e~^ , 

for some constant C > 0. Thus, for £ G and m G [0, n], 

|^ + i(0-^S(e)|<cA^w(o. 

Thus, by an inductive argument, 

14^(0 "42.(01 < C^e"™* £>?>(£) < CXie^DW®. 
Continuing with the right side of ( I4.16p . 

~6(x'-X T )5(p'-P T )Ke cx "^ 
8(x' - X T )5(p - P T ) 



fo dr £o{Pr)\ 



(4.17) 



a!e! 0) , 

(a:,?) 



< ^E^ p) 



+ a^; o) . 



5(x' - X T )5(p' - P T )Ke cxl ^ x (\\U T > 1) 



< 



2X^%(x,p; dx', dp') + A^ ( sup j(v))M^ >p) \U T e cxl ^ X {^K > l) 



dx 'dp' 



^ ^ -L 

< 2X*%(x,p;dx',dp') + A2(const)e" A ^ dx' dp' , 

where j(p' — p) = Jo{p,p') ■ For the first term, the second inequality is for A small enough such 

1 

that e cxis < 2 and by the definition of 7o- The inequalities for the second term follow by 



K X >P) 



5{x' - X T )5(p' - P T )M T e cxh ^ X {^K > 1) 



< E 



(o) 
(z,p) 



E[5(x'-X T )5(p'-P T ) \K, T t - ]Ke cxh ^ X (^K > l) 



< (const) E N T e cx ^ NT x{^K > l) dx' dp' = (const) ^ — — < (const)e" A ^ dx' dp' 



where the last inequality is for A small enough, and the equality holds since r is mean-1 
exponential and Aft is a Poisson process with rate |. Again by the remark following (14. 15ft . this 
term is smaller than a constant multiple of T\(x,p; dx', dp'). This completes the proof. 

Part (2): We can rewrite the expression 



/ %(s,ds') gi x \s; s')-g£\s; s') 
Jt, 

\j T drF(S r )) v S(S T -s') 



%(s,ds') 



E 



(o) 



T\(s,ds') 



{J T drF(S r )) v 5(S T -s') 
%(s,ds') 
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By adding and subtracting inserting the term 



E 



(A) 



(Io T dr d I(X r )Y5(S T -s') 



%(s,ds') 

in the absolute value and applying the triangle inequality, we attain the terms on the left sides 
of (EED and ( jUHD below. 

Let T T , r r A , Dl x \ and x T (^),p T (^) be defined as in Part (1). We have the inequalities 



drF(Sr)) S(S T -S') 



drF(Sr)) S(S T -s') 



< E 

< E 



Ei°) 



d£\DW(0-D?\0\ / drF(x r (0,p r (0) 



de|4 A) (0-4 0) (0| / drF(x r (0,p r (0) l + 



<2||F||^E 



r v / de|4 A) (0-^ 0) (e)| 

■/r T , A 



(4.18) 



± 

where the error from replacing T T: \ by T T is of order 0(e~ 5x 3 ) by a similar analysis as in the 
proof of Part (1). The factor of 2 on the last line was introduced to cover the error. Inserting 
a telescoping sum of D^l, and applying the triangle inequality as in the proof of Part (1) gives 



E 



/oo n „ 
da |4 A) (0 - Df\o\] < E E E [ r "7 w « l^+i(e) - 43.(01 
-V,A n =0m=0 F t,a 

oo n „ 

< (const)Af J2J2 e [ tV (n) d U\PtJ0-Pt^ (01 + 1)42.(0 

n=0 m=0 F t,a 
oo n „ 

< (coiiBt)A* jV* s » £ e[t» / d£ (b*J0 - P t - (01 + l)4 0) (0 



n=0 m=0 



const)A4E (0) r 1 " (|P tn - P t - \ + l) = (const)A3 / dp' j(p' 



n=l 



For the second inequality above, we used the second inequality from (I4.17p . The third inequality 
follows from the relation Z}[ A m(£) < e cx ^ m D?\^), and then by extending the domains from 
T^l to ri n) . Thus, we have shown that (jHHD is 0(A*). 
The other term is 



\T\(s,ds') - %(s,ds'] 



E 



(A) 



{J T drF(S r )) v S(S T -s') 



7n(s, ds' 



< II^IISo / \Tx(s,ds')-%(s,ds f ) 



E, 



(A) 



r^(5 r - s') 
%(s,ds') 



(4.19) 
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We split the right side into two more terms based on the events r > A and r < A ^ and 
apply the triangle inequality 



\T\(s,ds') - %(s,ds')\ 



E 



(A) 



r v 5(S T 



%(s, ds' 



< A~* / \Tx{s,ds')-%{s,ds') 



E 



(A) 



5(S T - s') 



T (s,ds') 



cA3 / E( A ) 
Jt, 



t v 5(S t - s') X (r > X~^] 



< / \T\(B,ds')-%(s,ds')\+c\*v\e 
Jt 



-A"^J 



0(X 



where we have used that E 



(A) 



5(S T — s') = To(s,ds') for the first term and Part (1) for the 

send term. The second term on the second line is equal to c\^~E[t v x(i~ > A - ^)] which can be 
computed explicitly, since r is exponential. The integral J E \T\(s,ds') — %(s,ds')\ is O(X^) 
by a similar but simpler analysis than applied above for (I4.18p . We could have modified this 
argument to show that (I4.19P is 0(A 7 ) for any \ < 7 < | by taking the smaller cut-off 



t > A ( i j)v in the analysis above. 



□ 



4.3 Summing a functional over a life cycle 

Let : L°°(£) be defined as 



s e E, 



(4.20) 



where h : £ — > M + is defined as in Convention 14.21 Operators of the form (I4.20p were introduced 
in [27]. In the proof of Lemma I4.1UI below, we will need the following result, which is taken 
from [5]. 

Theorem 4.9. Suppose that g G L°°(S) and g > 0. There is a c> such that for any g and 
\p\ < A- 1 

(U {x) g)(x,p) < c\\ g\\ oo + c\p\ sup g(x',p') + c dp' dx (l + \p'\ A \p\)g(x',p'), 

h'>|a- 2 Jh'<\\- 2 

WU^gU < cA" 1 sup g(x',p') + c sup (^^(xV), 

H'>\\- 2 H'<\\- 2 

where H' = H(x',p'). 

Note that g can be written in the form 

00 

{U^g)(s) = EW (1 - MM •••(!- M^-JM^j] 

n=l 

hi hi hi 

= E^[^g(sJ\+h(s)%^[^g(sJ\>E^ (4.21) 



n=l 



n=0 



n=l 
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where the first equality is from the proof of [T71 Prop. 3.4]. The second equality embeds the 
quantities in the split statistics and uses Part (3) of Proposition 14.31 

The following lemma states that an additive functional of the resolvent chain Y2n9x{ a n), 
where the summation is over a single life cycle and for g\ > with sufficient decay at large 
momentum, has arbitrary finite moments. In other words, not much happens over a single life 
cycle. 

Lemma 4.10. Let g x : £ — > R + and suppose that there is a C > such that 



9x(x,p) < C 



1 



1 + \p\' 



V A. 



for all A < 1 . Then, 



A<1 



n=0 



< OO, 



m e N. 



Proof. For the case m = 1, we have a closed expression f = g\ 

J s ds ^ 00iA (s)/(s) 



E^[j2f(S T 



n=0 



< 



J s ds *oo,a0) h(s) 

i| P |<A-i dxdpf(x,p) + ^erfc(A - 3) sup| p | >A -i f(x,p) 



J s dxdpe- XH ( x 'Pl h(x,p) 



(4.22) 
(4.23) 



where erfc(g) = dpe ~ is the complementary error function, and the equality holds by 
Part (1) of Proposition 14.41 The right side of ( I4.23P is finite by our conditions on g\, and since 
the denominator is approximately J s ds h(s) for small A, and thus bounded away from zero for 
A < 1. 

For m = 2, we write 



ni 

4 A) [($>(s T 



n=0 



n=0 
hi 

n=0 
ni 

< [J] «a(W + 2g x (S Tn )(U^g x )(S. 



2 £ 9x(S T Jg x (S Tn 

hi 

2g x (S Tn )E^\ 9x(S T 



m—n+l 



n=0 



(4.24) 



The a-algebra T T - contains the state S Tn since a.s. lim^ Tn St = S Tn . However, by Part (3) of 
Proposition 14.31 the binary component Z Tn has probabilities h(S Tn ) and 1 — h(S Tn ) for being 
1 or respectively given the information T T - . By these observations and the strong Markov 
property at the time r n we get the first equality below 



m=n+l 



E 



(A) 



5>a(£-J <(U {x) gx)(S Tn ), (4.25) 



m=l 



37 



where 5 S = x( z — 0)(1 — h(s))5 s + x( z = l)h(s)S s is the splitting of the 5-distribution as s G £. 
The inequality is the observation (I4.2ip . 

We can apply 0O3D with / = g\ + 2g x U w g x to bound the right side of (Olj) . Clearly, 
the contribution from ^ is not a problem, since g1(x,p) < Cg\(x,p). For g\U^(g\), there are 
constants such that 



2a(p)(£/ (A) 3a)Go) < (const) 
9x(p){U {X) gx)(p) < (const) 



l + log(l + \p\) 



1 + log(l + 


a- 1 !) 




A 


-2 



for all \p\ < A" 1 , A < 1 
for all |p| > A" 1 , A < 1. 



We have applied Theorem 14.91 along with our conditions on g\ to get 

r\p\ r\p\ v > 

(U w g x )(x,p)<c + c / dp'p'\g x (p')\<c + c' / dp' F < c + c" log(l + \p\), 

Jo Jo 1 + \P I 

for some c,c',c" > and all A < 1 and \p\ < A -1 . Similarly, we can obtain the analogous 
inequality for the domain \p\ > A -1 . 

Now we sketch the proof for the general case m > 2. For e.,- G {<,=}, j < m let the set 
£( ni )(ei, e m _i) be the collection of all (r 1; ...,r m ) G [0,nij m with r x ex r 2 . . . e m _i r m . Also 
define, 

/(ei,...,e m _i) = ^ei " " " 4 ra _i9A, 

where A=, A< are linear operations in which yl = is multiplication by g\ and A< = A = U^ X \ We 
can write 



n=0 

for cr n = S Tn . However 



Lin. comb, over 
(ei ■ ■ -e m _i) of 



E 



(A) 



5^ ^aK-J---^^) 

£( n l)(ei,...,e m _l) 



E 



(A) 



5^ 9aK)-'9aK) <iS> A) [^/( eWm _ 1 )W 

£( n i)(ei,...,£ m _i) 



n=0 



(4.26) 



since we can write the difference between the right and left side of ( I4.26P as a sum of positive 
terms c v _i — c v indexed by v G [1, m — 1], where 



^2 #*(°Vi) ■ ■ ■ gx{(Tr v )f(e v+1 ,...,e m - 1 ){crr v+1 ) 
< n i)(ei,...,e„) 



When e t ,_i is =, then c„_! and c v are identically equal. When e v _\ is <, then the difference 



3s 



c v — c v -i is equal to 



E 



n=r v +l 



"^( n l)(ei,...,€„-i) 

i A) [ &A(0-r 1 )---fl'A(0-r„_i) 



lJ(ei,...,e„_i) 



^ A (o- r jE (A) f(e v+1 ,...,e m - 1 )(^r n ) ^S t - ~ f(e v ,...,e m -i) (°>J 



7i=r„+l 



< E 



(A) 



5^ ^On) ■ ■ •PA(^ r „_ 1 )^A(^rj(^ (A) /( £ „,.., £m _i))(^r„) ~ /(,„,..., e m _x) Or J 

£(°i)(ei,...,e„_i) 



where the second relation is an inequality rather than an equality because ^ A V(&u,...,e m _i)( cr 



over-counts the sum EW 
gously to (14251) . 

Thus we are left to bound E~ A ^ 



En=r„+1 / (£»+i.->fm-i) ' 



En=0 /(ei,...,e m— 1 



)l Cr n 



in the event that £ rtj = 0, analo- 



. The worst case scenario is when all 



the ej are <, since mere multiplication by g\(p) introduces more decay for large \p\. By our 
conditions on g\ and m — 1 applications of Theorem 14.91 

4 (1 + M1 + W)) 



771— 1 



1 + \p\ 



\p\ < A" 1 , 



and we get another bound for \p\ > A 1 which is smaller than a fixed multiple of A 2m 1 for all 
A < 1. Applying the inequality (I4.23P then we attain the bound. 

□ 

The proof of the following proposition is analogous (although simpler) to that of Proposi- 
tion Uj3in the next section, which concerns the case of the oscillatory (non-integrable) function 

Proposition 4.11. Let g : £ — > R satisfy that \g(x,p)\ < i+\ p \2 for some C > and all 

(x,p) e S. 



1. For any m G N, there is a C > such that 



supE^ 



A<1 



sup I / dr g(S r 

0<t<Ri \ Jo 



2m 



< c. 



2. There is a C > such that 



A<1 F ' 



d r g(S r ) <C(l + log(l + H)). 



39 



4.4 Inequalities for the momentum drift 

This section is focused on various inequalities relating to the expectation and higher moments 
of f^dr^(X r ) for certain hitting times oi < One of the main points is Part (1) of 
Proposition 14.141 which states that 



sup4 A) 



A<1 



sup 

0<t<R! 



d Ax r 

dx 



< oo 



for each m > 0. This implies that there will typically not be large fluctuations in the momentum 
drift A = f*dr %{X r ) for t between successive returns R n , R n+ i to the atom (i.e. during a 
single life cycle). 

The first two parts in the lemma below follow from the conservation of energy and the 
quadratic formula and do not depend on the potential being periodic. The third part of 
Lemma 14.121 is a statement about mixing on the torus. If the particle begins with a high 
momentum |P | ^> 1 and is stopped at a random exponential time r, then the distribution on 
the torus T = [0, 1) at the stopping time will be roughly uniform-even in the presence of the 
bounded periodic potential V(x) 

Lemma 4.12. Let (X t , P t ) evolve according to the Hamiltonian H(x,p) = \p 2 + V(x), for a 



positive potential V(x) with sup x V(x) < oo and sup x ^■(a?) < oo. If the initial momentum 
has |P | 2 > 4:V , then the difference P t — Pq = f dr ^(X r ) satisfies 



1. su Ptei?+ | f*dr^(X r )\ < 2sup x V(x)\P \-\ and 



2. 



S>%[x, 



V{X t )-V{X Q ) 
Po 



<2tsup J^(x)|su Pa; ^(x)|Po 



3. Suppose further that V(x) has period one. If t is exponentially distributed with mean r 1 
and F : T — > M is a function of the torus and bounded, then 

E {X(hPo) [F(X T )] - f dxF(x) ^rUFlUlPor' + OdPol" 2 ). 
Jt 



Proof. 



Part (1): Since |Po| 2 > 4sup x ^(x), the momentum P t will not change sign at any time. By 
the conservation of energy 



1 



P + (Pt ~ Po 



-P 2 = -V(X t ) + V(X ) 



2 1 u/i 2 

Using the quadratic formula and that Pt, Po have the same sign, 



IP-Pol = \\P Q \-(P 2 +2V{X Q )-2V{X t )Y\ < 



2V(X )-2V(Xt) 



2J 



dw (Pq+w) 



< 



2sup x V(x) 
|Pnl 



since 



(P 2 + w) ~ 2 < v^lPol" 1 < 2IP,!- 1 for \w\ < \ Pi 
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Part (2): With the identity V(X t ) - V{X ) = f* dr %{X r )P r , then 



' rfr ^ w _L«)-^o) 



o 



dx 



P 



o 



< 



dr 



< t|P | _1 sup \^-(x)\ sup \P r - P | < 2tsup \^-(x)\snpV(x) |P |~ 2 , 



dx 



dx 



where we applied Part (1) for the last inequality. 



Part (3): Let d s : T — > M + be the density of the particle at time r on the torus starting from 
the point s = (X , P ) G £. Then, 

E S [F(X T )] = [ dxd s (x)F(x). (4.27) 
J j 

This leads to the simple bound 

\E S [F(X T )] - [ dxF(x)\ < IIFIUH^-Illi. 

JT 

Thus it is sufficient to show to bound the 1-norm of d s — 1, and, in fact, our bounds can be 
made in the supremum norm. 

Notice that eL can be written as 



d s (a) = J2\Pt ni a)\- 1 re-^ 



(a) 



n=l 



where t = ti(a), t 2 (a), ■ ■ ■ are the periodic sequence of times for which X t mod(l) = a. These 
times will exist for every a G T as long as H(X , P ) > sup x V(x). 

If Asup x V(x) < P 2 , then \P t - P | < 2( sup x V(x)) \Po\~ 1 by Part (1). Thus for large 
momentum |P | 3> (sup x V(x)) 2 , P r is nearly constant and the hitting times t n (a) will be close 
to the sequence of times t = t' n (a) at which Xo + tPo mod(l) = a for a time period at least on 
the order of tz. The period A such that t' n (a) — t^_ 1 (a) = A should thus be close to jp^. When 
|P | is large enough so that \P t — P| < 2 sup^. V(x)|Po| -1 < ||-fb|> then clearly A < jp-r, and 



A- 



< 



W " |Po| 

6 swp x V(x 



drP 



drPn 



<T5l( dr\P r -Po\+r P ° ] dr\P r -P \) 
\Po\ Jo Jo 



< 



(4.28) 



The difference between the first crossing-times \t[(a) — ti(a)\ of the point a can be similarly 
bounded. 



Using the triangle inequality 



\d s (c 



1 < 



dJa) 



r e 



-rf n {a) 



n=l 



00 1 00 



n=l 



r e 



-rt n (a) 



n=l 



1 00 

i I r p -rt n (a) _ 1 I 

'l^ol 2 - 



n=l 



2r 1 



p 







P 







(4.29) 
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where the last inequality follows by further computation using the inequalities above. For 
instance, we can bound the first term on the right as 



IdJa) 



oo 



r e 



n=l 



-rt' n (a)\ 



Define the functions : X — > 



< sup 1 



Po 



-rt',(a) 1 rA < 2SUP X V(X) 



|P |A1 -e" rA 



p2 



□ 



ci x Hs) 







(A) 



T2 dV 
X (Z T1 =0) I dr-^(X r ) 



dx 



E 



(A) 



X (Z T1 =0) / cir— (X r )-C 



dx 



2n 



where ri,r 2 are the first two partition times, n € N, and 5 S = (l — /i(s))<5( s>0 ) + /i(s)5( Si i) (i.e. 
the splitting of the 5-distribution at s). The presence of the factor x(Z T1 = 0) in the above 
definitions is a small technical precaution, and if x{Z T1 — 0) is removed in the formula for 
Cq (s), then we have 



E (A) 



T2 dV 



T2 dV 



E (A) 



Lemma 4.13. For any n > 1, there exists a C > such that for all A < 1, then 

1. |ci A) (x,p)| <C( TT ^VA 2 "), and 

2. |C< A W)|<tf(i4pVA) . 
Proof. 

Part (1): For w = 2m, Cm'(s) is smaller than 



C£>(s)<if 



where the equality is since the initial distribution 5 S is the splitting of 8 S , and the argument of 
the expectations only depends on the pre-split statistics. 

in 

It- 2 d r is^(Xr) f° r s — ( x ,p) i n the three regimes of 



We will bound E 



(A) 



(i) . arbitrary p, 

(ii) . K \p\ < A -1 , and 
(Hi). A- 1 < \p\. 

In the third regime, the absolute value of the momentum is generally drifting downwards and 
there is also a higher frequency of collisions. 
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(i). For arbitrary s E E, we have 

E wf [ T2 dr ^ Xr 

dx 



< sup |^(x)rE[r 2 - Tx] < f ! sup |^(x)r, 



da; 



(4.30) 



since r 2 — T\ is a mean-1 exponential. 

(ii). Next, we consider s = for the regime 1 <C \p\ < A -1 . 



dr^(X r 
dx 



< 



sup 

s'=(x,p) 

ble[|bl,!bl 



E (A) 



SUp 



dx 



x 



ax 



3 5 

\P r \ for some r < Ti 



(AM) 



For the second inequality, we have split the expectation into two terms corresponding to whether 
the event \P r \ E § |p|] for all r E [0, Ti] occurred or not. In the case that the event occurs, 

then we get the first term on the right side in which we have used the Markov property and 
relabeled r 2 and as the first partition time T\ (starting from zero rather than Ti). For the 
complementary event, we applied Cauchy-Schwarz and the inequality (I4.30p . 

To bound the second term on the right, we may apply Chebyshev's inequality, 



p(A) 



\P r \ i [|bl,||^l], for some r < n ] < (A)"lE (A) 



sup 

0<r<pATi 



\Pr~p\ 



sup 

0<r<pAri 



Jr\ 



w\ (— )™sup 

\P\ XGT 



,dV. 
' dx 



[x 



(4.32) 



where w > 1, p is the first jump time \P r \ leaves §M], and J r = P r — p — f^dr ^(X r ) 

is the sum of the momentum jumps up to time r. For the second inequality, we have used 
(x + y) w < 2 w (x w + y w ) and (14.301) to bound the contribution of the potential drift. The 
absolute values of the jumps \Jt n — Jt n -\ have uniformly controlled Gaussian tails and occur 



Pr 



-P 



2<< 



with Poisson rate £\(P r ) < £a(2A _1 ) for r < p, and thus the expectation of sup 0<r<pAn 
is finite. It follows that (I4.32p decays super-polynomially fast for \p\ 3> 1. 

Now we bound the first term on the right side of (14.311) . Let <; be the hitting time that St 
jumps out of the set A p = {(x',p') E £ | ||p'| — \p\\ < ||p|} starting from some s' = (x,p) for 
\p\ e [f |bl]> Define the times t' n = t n A T\ A q, where t n is the time of the nth momentum 
jump. By writing, 







dx 



00 

E 



n=0 " 

we can apply the triangle inequality to get 



"+ 1 dV 

dr—{X r )+ X ^<K 



d Ax r 

dx 



(A) 



<e£> 



E 

n=0 

00 

E 

n=0 



"+ 1 dV 
dr^-(X r 
dx 



e; 



(A) 



tL 



dAx r 

dx 



t dV 



n dAx r 

dx 



(2 V )\ supl^^)! 2 ") 2 "?^^ <n]^, (4.33) 



dx 
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where the second inequality follows by Cauchy-Schwarz and that T\ is a mean-1 exponential. 
The probability < n] decays faster than any polynomial by the same reasoning as 

for (I4.32p . The first term on the right side of f!4.33j) is bounded by 



=?>[(£ 



n=0 



"+ 1 dV 
dr ^-(X r 
dx 



< 



4sup xeT y(x^ 
\P\ 



(4.34) 



where Nt is the number of collisions up to time t. The above inequality uses the definition of 

the t'^s to conclude that for each n, either t' n = t' n+l so that J t , n+1 dr $r-(X r ) = 0, or \Pf n \ > ||p| 

i t' n 

so that we can apply Part (1) of LemmagTJto bound | f t , n+1 dr %(X r )\. The counting process 
Nt has Poisson rate £\{Pt) at time t. Up to time we have that £\{Pt) < £a(2A _1 ) < r for a 
constant r which is guaranteed to be finite by Part (1) of Proposition 13.11 



e w W ]<e[W)T=-^J>»(-^-)' 

n=0 



where N' t is a Poisson process with rate r and r is mean-1, exponentially distributed, and 
independent of N[. The first inequality can be seen by a construction N' T ~ N^+N T for a jump 
process N^ with Poisson jump rate r — S\(P t ) for t < <; and rate r for t > ^ whose jumps are 
decided independently of the jumps for N r . 

(iii). Let $ = T\ A$' where is the hitting time that the absolute value of the momentum \P t \ 
jumps below A -1 . The hitting time is finite, and, in fact, has an expectation that is bounded 
by a multiple of A -1 independently of the initial momentum \p\ > A" 1 . However, the details 
for these points do not matter for this proof. Let (p s be the distribution on T x [— A -1 , A -1 ] for 
P$i) starting from s G E. By the triangle inequality and the strong Markov property 



o 



dx 



v < Ei A > 



dAx r 

dx 



T1 d Ax r 

dx 



(4.35) 



For the first term on the right side on (14.351) . we can write 



d Ax r 

dx 



A dV (Y 



n=l 



<2^(supK(x))X A) E 



dx 

M$ 



n=l 



\P- 



< CX V E[ X) 



m$ 
[IE 



(4.36) 



The first inequality is Part (1) of Lemma [4.121 which is applied for the Hamiltonian evolution 
on each interval [t n _i,t n ). The C for the second inequality is guaranteed by Part (2) of Propo- 



sition 13.11 Notice that ^2 



Mr 



1 Sx(P-) 

Jo ^ s s x \p ) i since the counting process N r has jump rate £\(Pi 
triangle inequality, 



— r is a martingale with predictable quadratic variation 

Since $ < T\ and by the 



1 



M T - 



< E (A) 



i 



n 



< C"E (A) 



ds 



Sx(Ps 

< (C" + l)(l + A)8(u!)« +(«!)« 



E (A) [ T «] „ 

Ei A ) 



l<n<M T1 t-xK^t- 



E (A) |- r «j i 
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where the constant C comes from Rosenthal's inequality (see e.g. [TT], Lemma 2.1), and third 

by Part (1) of Proposition 13.11 and since T\ is exponential 



8(1+A) 



inequality is because £\(p) > 
with mean one. 

For the second term on the right side of (I4.35p . we can apply our results (i) and (ii) above 
to guarantee the existence of a C > such that 



dr^(X r ) 



1 



1 + \p'\ V 

Li i] dp" T ^j x (p', P ") M dpi'^j^p'y) 

^ C ^ t^j^) = 0{XV)+C ^ t^MM ' (437) 

where the third expression should be understood as the supremum over all \p'\ > A -1 for the 
expectation for 1+ ^ p & conditioned on p' = P T - . The final term in ( I4.37P is super-polynomially 
small by Part (7) of Proposition 13.11 and Chebyshev's inequality. 



Part (2): By (I4.30p . we have an upper bound for sup^) |Cq (x,p)\. For p in the regime 
1 <C \p\ < A~ x , define q as in Part (1). Also define t n as the sequence of collision times starting 
after T\ with t = n, and t' n = t n AqAT 2 . The difference between Cq A ^(s) and E^ [ dr ^(X, 
is small when \p\ ^> 1, since 



Ei A > 



T2 dAx r ) 



dx 



-C 



E 



E f 



(A) 



(A), 
1 

T2 



E 



(A) 



dx 



i(A), 



dAx r 

dx 



< Ej 



7"2 



dV 



dA Xr 
dx 



S E (A) 



fr 2 (S 



dV 



Os 



h 2 {S. 



< 2^sup|^(x)| 5 Ei A) [h 2 (S, 



n 



E~ . The first inequality is Cauchy- 

Os 



The first and the third equalities have used that Ej 
Schwarz, and the second inequality uses that r 2 — T\ is a mean-1 exponential. The function 
/i(s) < 1 has compact support, and there is a c > such that E^^ [/i(5 Tl )] < ce~ A 1 V e~' p '. In 
fact, the bound can be given a Gaussian form by the Gaussian tails for the jump rates (jl.2p . 
By the above remarks, we may work with E { S X) [ HI dr 7E( X r)] > 

since the error is 0(\p\ 2 ). 

Now we will show that the difference of this term with the expression ^E^ [V(X T2 ) — V(X Tl )] 
is also 0(\p\~ 2 ). Similarly to Part (1), we can write 



dV 



n 



dV 



dr %-(X r ) = Y, / dr ^-(X r ) + X ( q < r 2 ) / dr ^-(X, 



n=0 « 



da; 



dV 



cVri 



dx 



The difference is bounded by 

1 

dx 



E (A) 



n 



P 



< 



sup x VfoO 

M 



+ sup 



dV 
dx 



F s [? < r 2 ] 



+ 



e1 a) [ E 



dx 



n=0 
n=0 



1 



(4.38) 



15 



where M r , r > t\ is the number of collision times t n in the interval (t\, r], and the first term on 
the right side occurs by bounding the expectations of V ^ Xt2 ^ v ^ x ^ and xis — T i) J^ n dr ^(X r ). 

V(X t , )-V(X t ,J 

The above follows by adding and subtracting terms ri± j 5 — for n G [0, AQ and applying 

the triangle inequality. By the analysis in Part (1), P s < r 2 ] decays super-polynomially in p. 



"+ 1 , dV 



n=0 ™ 



ax 



V(X t ,)-V(X t ,J 



Pf 



<2sup|^(x)|su P y(a;)E( A )[^ « +1 - f n ) |P r J~ 2 ] < A sup | A( x ) | sup 



n=0 



where the second inequality uses that |P r? /J > \\p\, by definition, for n < M q , and also uses 
£ d - *n = ? < t 2 - T! so that E?> [ £ t' n+1 - t' n ] = E< A > [ q ] < E<*> [ Ta - rj = 1. 

n=0 n=0 

For the third term on the right side of ( 14.381) . 



n=0 



< 



supV(z)EW[X; ^ry] <^(^p|^^l)(^P^)) E i A) [E b- p 4l 



n=0 



-^ (su , p| ^ (x)|)( T nx)) ^ A) 



sup P r — p\ 

0<r<? 



f E?>[^]* = 0(|p|- 2 ). 



The first inequality uses the definition of q to conclude that < \Pt' n \ < §|p| for n < M q , and 
the third is Cauchy-Schwarz. Arbitrary moments of M q are finite by (14.34 p . 

For the term ±Ei A) [V(X T2 ) - V(X n )], we just need to show | IE£ A) [V(X T2 ) - V(X Tl )] \ = 
0(|p| _1 ). By the triangle inequality, 



\E^[V(X T2 )-V(X T1 



< 



Ei A) [V{X T2 



dxV(x)] + E[ X) [V(X T1 



dx V(x)\ 



The terms on the right side are similar, so we will study the second. Let us offer a reconstruction 
of the counting process Mt for the number of collisions up to time q as follows. Let N' be a 
Poisson clock with rate r = S\(2X~ 1 ) as in Part (1). The Poisson rate of jumps £\{Pt) for the 
process Mt satisfies £\{Pt) < r for times t < q. At each jump time r n < q for the Poisson 
process N', we then flip an independent coin with weight r~ 1 £\(P r . n ) to determine if a jump 
for Mt (i.e. a collision) occurred at time r n . This recovers the statistics for Mt- We then define 
r' n = r n A T\ for n < N^ A . Conditioned on the past T r i n and the event T\ > r' n , the increment 
r' n is exponentially distributed with mean (1 + r) -1 . When conditioned on the event 



n+l 



Tl 



n+l' 



the increment is exponential with mean 1. 



46 



We can rewrite the expectation Eg [V (X T1 )] in terms of the event T\ = r' n for n > 1 or 



T\ > max n r r 
E< A > [V(X n )] = [V(X T1 ) X (rx > max r' n ) 

oo 

+ E^\Y,x(T 1 =r' n+1 )EW[V(X T1 )\F r , n , n = < l+1 ]j. (4.39) 



n=0 



The first term on the right is smaller than sup x V^(x) times the probability of the event 
max n r^ ^ t\, which can also be phrased as the event that q < t. By the analysis in Part (1), 
F« [<Z < t] is super-polynomially small in \p\. Thus ^^L ^ A ^ Tl = r n+i] * s super-polynomially 
close to 1. Since r' n+1 — r' n is exponentially distributed, by Part (3) of Lemma 14.121 

E^[V(X Tl )\ Jv n , ri = r^.J - / cfeVfc) < (const) I^T 1 < 2(const)|p|- 1 , 

Ji 

where the second inequality follows since \P r > n \ > \\v\ by the definition of the r' n , which are less 
than q. 

The analysis of the region \p\ > A -1 follows from that of Part (1), and so the proof is 
complete. 

□ 

Proposition 4.14. 

1. For any m G N, there is a C > such that 



sup E { ~ x) 



A<1 



sup 

0<t<Ri y Jo 



2m 



< c. 



2. There is a C > such that 

,(A) 



sup E 

A<1 



(x,p,z) 



Rl dV 1 

dr-(X r )\ <C(l + log(l + b|)). 



3. As A ^ 0, 



E 



(A) 



Rl alV 
dr^-(X r 
ax 



R2 dV 
dr'^-(X r/ 



dx 



E 



(o) 



dx y ■ 



R2 dV 1 

dr'^-(X r ,) +0(A^ 
dx J 



Proof. 

Part (1): For < t < R u 







* dV 







T1 dV 
dr— pT r )-x(C N< =0) 



TN *+ 1 rfV^ 



- x (C Nt+ i = 0) / rfr — (X r ) + ^ C< A) (a») + m 1 + < (4.40) 

jT N t + i flX n=0 
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where m t and m' t correspond to odd and even contributions of the form 



L|N t -ij+l 



m, 



m, 



E (x(C2n = 0) / dr -j-{X r ) - Cf'K, 

n=l ^ T2 ™ 

E (*(C**+i = °) / dr T {Xr) ~ c ° 

n=0 ^T- 2n+ i 



The processes m t , m' t are not adapted to Tt-, since, for instance, m' t makes the jump 

x(C 2 n+i = 0) / dr—(X r )-C { X \a 2n ) 

at time r 2n , and the size of the jump depends on X t up to time r 2n+ 2- Let T" be the u-algebra 
of all information before time r n+ 2, where r n < t < r n+1 . The process m' t is a martingale with 
respect to T". To see this let us consider a time t < r 2n _i, then the following equalities hold: 



E,~ 



X(C2n+l = 0) 



T2n+1 UX 



E f 
E £ 

E,: 



E £ 



X(C: 



f T2n + 2 dV 

2n +l = 0) / rfr -^-(-Xr 

x(C 2 n + i = 0)/ dr— (X 

^r 2n+ i ax 



T" 



(4.41) 



The nested conditional expectation on the first line uses that T" C T T - , and the third equality 

is by definition of Cq A \ The second equality uses that ( 2 n = 1 or ^ 2n — with probabilities 
respectively of h(<T2n) and 1 — h(a2 n ) given T T - by Part 3 of Proposition 14. 3[ the strong 

Markov property starting from the time T2 n , and the definition of the distribution 5 S (which is 
the splitting of the (^-distribution at s G S). The predictable quadratic variation (m[) for the 
martingale m' t has the form 



L§N t j 



(4.42) 



n=0 



The analogous statements hold for m t . 
By the triangle inequality, 



E 



(A) 



sup 

0<t<R! 



+ E 



(A) 



* dV 2m 

sup (Elci A) K) 



i 

2 m 



<6sup|^-(x)|E^ A) [ sup (r Nf+1 -r Nt ) 2m ] 



i 

2)1! 



2m 



i 

2,11 



fix 



' 0<t<i?i 
|2mi 



E^[ sup |m t H^+EW[ sup |m;n-, 

0<t<R L 0<<<i?i 



(4.43) 



where we have bounded each of the first three terms on the right side of (I4.40p by the supremum 
of |^-(x)| multiplied by the longest interval r n+ i — r n over n < h\. We used a factor of 6 rather 
than 3, since there is one term for which the interval [r 2n+ i, r 2n +2] will have r 2n+ i G [i?i,i? 2 ] 
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rather than < R\. We thus double the bound, since we can apply the Markov property for 
stating from time R\. We now look at the terms on the right side one-by-one. 
For the first term on the right side of (14.431) . 



EH sup (r Nt+1 -r Nt ) 2m ] =E^[ sup (r n+1 - r n ) 2m ] 

0<t<i?i 0<n<ni 



< c 2m M A) 



E[ sup e^nj < c' + c% X) [(logfa))""] , (4.44) 



2mi 



0<n<n 



where e n are i.i.d. mean-1 exponential random variables independent of everything else. The 
c > in the first inequality is from Lemma [4.61 and replacing the (r n+1 — r n )'s with the e n 's. 
The d > for the second inequality exists by an elementary analysis of E^ A ^ [ sup 0<n<Ar e 2m ] 
for N > and independent exponential random variables e n with mean one. The value 
E~ [( log(ni)) 2m ] is finite by Proposition 14.71 since the fractional moments E~ [n"] are fi- 
nite for < a < \. 

For the second term on the right side of (14.431) . obviously 



Ef A) 



sup ^|ci A) K) 



2m 



E 



n=0 



ni 

f[(E 



ci J, K) 



2mn 



n=0 



since N t = ni for t G [i? n _i,i? n ). By Lemma f4.13[ g\ = Cq has the inequality g\(x,p) < 
C(j^2 V A) for A < 1. Hence by Lemma f4.10[ the above sum is bounded independently of 
A< 1. 

The last two terms on the right side of ( 14. 43ft are similar, so we will only treat the last. By 
Rosenthal's inequality (see e.g. [11], Lemma 2.1), there is a C" such that 



2m 



sup 

0<n<m 



eH sup | m ; 

0<t<R! 

fti+i 

<C"E^\(j2c[ X \a n ] 



n=0 



C"E^ 



0) 

ftl 

^ C (A) 



n+1 





fT2n+2 


= 0) 






' T2n + 1 




, dV 




dr — — 


' Tn + 1 


dx 



dV (A) 
~dx ~ 



i(A), 



4m 



n=0 



n=0 



For the second inequality, we have used the standard technique to bound the supremum in 
the second term with ( sup n a n ) 2 < Y^ n a 2 and Jensen's inequality, and we have also included 
the odd terms. The first term in the third inequality is bounded with the equality (I4.42p 
and by including the odd terms. For the second term in the third equality is by inserting a 
nested conditional expectation with respect to T - for the nth term in sum and applying 

n—X 

the argument in (]4.4ip . To bound both terms on the right side, we apply Lemma 14.131 and 
Lemma [4. 101 as above. 

Part (2): Again, we start by writing J^ 1 dr ^(X r ) as in ( I4.40p . and use the triangle inequality 
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to get 



e (a) 



+ E ( ; A) 



Rl dV 
ax 



dr^-(X r 
dx 



< E W 
+ E 



Q UjJU 

hi 
n=0 



E 



(A) 



dx 



+ E( A) [|m Rl |^+E^[|m^|^, (4.45) 



(A) 



where we have also applied Jensen's inequality to the last two terms. The first three terms on 
the right side are bounded by csup x 1^(^)1, where c > is from Lemma [4.61 This follows for 
the first term, for instance, since 



E 



(A) 



dr % iXr) W ^ su P|^^|Ei- A) h] <csup|^(x) 



where the second inequality is by Lemma [4.61 

Since m£ is a martingale, we have the first equality below 



A similar calculation holds for the term Eg | ] . With the above, then 



n=l 

I 2! 



n=l 



E< A > 



dx 



,dV 



< 3c sup |— (x)\ +E K 5 

x ClX 



(A) 



[5:ic« i »k)|] + 2e™[5:c« i »k 



n=l 



n=l 



< 3c sup | ~r~{ x ) I + BEW j I £ |C< A) K)| + &*E<£ , I £ C[ x \a n 



x dx 
JV 



hi 



n=l 



ni 



n=l 



= 3c sup \-j-(x)\ + b(W {x) C { x) )(x, P ) + bU (W (A) CS A) ) (x, p) 
x dx v 

for 6 = ^ V g and the map : ->- L°°(S) was defined in Section [OJ The 

second inequality uses that E s = (1 — /i(s))E( s ) + With Part (1) of Lemma [4.131 for 

g x (s) = C[, A) (s) and gx(s) — c[ \s) and Lemma 14,91 we obtain a logarithmic bound in \p\ for 
the right side. 



Part (3): We can split the expectation into two terms as 



E 



(A) 



Ri 



dV 



dx 
2 v 

2 v 



R 2 



,dV 



dr 'lx- [Xr ' 
R 'i dV 

dx 



~(A)T f Rl . dV 



R2 dV 
dr"—{X r ) I dr'—(X, 



dx 



Tl dV 
dx 



dx 



Rl dV 
dr^-(X r ) I dr'^-(X r , 
dx 



where v' is the distribution for G £ x 1 when S is distributed as v and has the density 



v'(s,z) = x{z = 1) 



f^ds^oc^^his) 
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The second equality above is by the strong Markov property for the split resolvent chain 
corresponding to the time R[ = . The two terms on the right are handled similarly, and we 
will treat the first. 

As in Part (2), we can rewrite the argument of the expectation as a function of the first 
component of the chain a n = (a n , ( n ) up to n = h\ rather than the process {St), since 



E 



(A) 



Rl dr^-iXr)) 2 ] =4 A) [/W(a ,...,a, 1 )] = / d^(0/ W (0, 



where T is the set of all trajectories £ = (er , . . . , cr^J starting in the distribution v and ending 
at the first return to the set E X 1, is the measure induced on T, and /( A ) is 



h\ — 1 



ni — 1 



ni — 1 



n=0 



n=0 



/ (A) (<J , . . • , (T fLl ) = ^ 92 fan, fffi+l) + 2 9l\^n, <T n +l) 9l (°m, <?■ 

for gin ■ S 2 -»■ E defined as 



n+l=m 



^Ol;^) 



"■n — s 2 



Note that /( A ) is positive, since it is the conditional expectation of | J^ 1 dr ^(X r )| z given 
the trajectory £. Let T\ be the event that fi\ < X~9 and \P Tn \ < A~3 for n < n\. By the 
triangle inequality, 



#a(£)/ w (£)- / #o(£)/ w (£) 



< 



#a(0/ (a) (0- / #o(£)/ (0) (£) 



+ / #a(0/ (A) (0+/ #o(0/ (0) (0- (4-46) 
The last two terms on the right side are similar, and the first of them is smaller than 



dMOf {X) (0=^> X(T A )( / dr^-{X r 



dx 



<P^ [T c A ] 5 sup4' 

A<1 



(A) 



R 'i dV 



where we have applied the Cauchy-Schwarz inequality By Part (3), the rightmost term is finite. 
We will show that P^ A) [T A ] = O(A^). Since \P r \ < Q r = 2^H?, we have that 



0<m<A~9 



(4.47) 



For the first term on the right side of (I4.47p . Chebyshev's inequality gives 

ft A) [ni>A"s] < A^E~ A ^ [n"] < C a \% , 
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where < a < \ and the C a of the second inequality is from Proposition 14.71 We can choose 



a 



20 ' 



For the second term on the right side of (14.471) . 



(A) 



SUP Q Tm > 



0<m<A~5 



sup Q Tm > A' 



0<m<A~5 



< 



sup Q r > A-4 + 0(A) 

0<r<2A~1? 



sup Q 2 r + 0(X), 

0<r<2\~y 



where T\ = r m for m = |_A~9 J . The first equality holds, since the argument of the probability 
is well-defined in the original dynamics. Since the times r m are mean-m exponential, the 
probability that r m > 2A~s for m = [A" 5 ] is super-polynomially small. There is a C > such 
that for all A < 1, 



sup Q r 

0<r<T 



< CT 



by similar techniques as for in the proof in the Lemma [2.21 Plugging in with T = 2A~5 implies 
that the rightmost term in (I4.47P is 0{X~&) and thus [Y A ] = O(X^o). 
For the first term on the right side of (14.461) . 



#a(0/ (a) (0- 



T; 



< 



# (e)/ (0) (e) 

(#a(£)-#o(£))/ (0) (£) 



+ 



#a(0 |/ (A) (0 - / W (£)|- (4.48) 



(0), 



Let ^ A be the measure over T or Ya in which the first n steps are made with transition 



measure % and the remaining are with respect to 7a- Thus ip 



(0) 



tpx, and i/j 



(N) 



ipo for 



N > [A 9 J in the event Ya. For £ G Ta, then applying Part (1) of Lemma [4.81 

Employing this inductively, then we have the two inequalities 

K" +1) (0 -#A re) (£)| < X(ni > n)bcX*e bc ^- n ^dM0 

< X {ni > n)bcXh bcxl dM0- 



(4.49) 



By adding and subtracting the terms dip^ (£), < n < \X 9 J as a telescoping sum and 
applying the triangle inequality and ( I4.49p . 



LA-sj-i 



#o(0 - #a(0| < E l^o n+1) (0 - #i n) (0| < bcXh bcX UM0- (4-50) 



n=0 



Thus with (I4.50p . the first term on the right side of (I4.48P is bounded by 



(dU0-dM0) / (0) (0 



< / |#a(0-#o(0|/ (0) (0 
<bcxh bcX * J dMOf {0 \0 = bcxh bcxh Ef ) [(J* 1 ■ rfy 



dr— (X r 

dx 



0(Xs), (4.51) 



52 



where the third inequality follows by extending the domain of integration. 
For the second term on the right side of (14.481) . 



dM0\f iX) (0- f i0) (C)\<e 



be AS 



< 



#b(0|/ (A) (0- / (0) (0| 

2e 6cAH / #„(£) E \9rK,(T n+1 )-gi°\a n ,a n+1 )\ 



hi — 1 



n=0 
hi— 1 



+ 4e bcAH / <# (0 E \g[ x) (a n , a n+1 ) - g { i\a n , a n+1 )\ E \9?(^m, o"m+i) 



(0), 



ni —1 



n=0 
ni— 1 



m=n+l 



ni — 1 



+ 4e bcA W # (0 E kiK,^n + i)| E \gf\a m} a m+l )- g {X \a n ,a n+1 ) 

n=0 m=n+l 

The three terms on the right side are handled similarly, so we will address the second. 

„ rii— 1 hi— 1 

/ #o(0 



) ~9l {VniVn+l)\ E |^ 0) (crm ; °"m+l)| 



m=n+l 



^ (o E I 

g[ X \a n , cr nH 

n=0 

„ hi— 1 fii— 1 /JT/ /" /n— x 

</ *(6E^K) E *4 A) K)<A-^snp|— (a) | / 



ni — 1 



n=0 



m=n+l 



n=0 



(4.52) 



<A *6( sup I— — (a) I) sup w[ X \x',p', z') = 0(\*), 
a ax 1 , , l 

|p'|<A"3 

where b = V ^ and iwi, : S — >■ IR + are defined as 

«,W(S)= /f^^^Ol^^^')-^^,^! and w ( 2 X \~s)= fr^(s,ds')\g[°\s,s% 
Jt, Jt, 

For the second inequality of (I4.52p . we have used that fi\ < A~^ for £ G T \ and 



wf } (5) < E s ~ 



dV 



dV 



dr —(X r ) < ( sup I— (x)\ )E 5 [rl <6sup| — (x)|, 



r/.r 



and the last inequality is by Lemma 14.61 Moreover, 
,[ x \§)=E ix) [\g[ x \S ,S T )-g?\S ,S T 



•UN 



<6E( A )[|^ A) (So,S T )-^ 0) (5o,5 r )|] = 6 / rfs'r (0) (s, S ')ki A) ( 



s,s ) -g{ [s,s )\. 



The first inequality follows by the usual relation E^ = (1 — /i(s))e| A qj + /i(s)E^L. By Part 

(2) of Lemma S21 the supremum of the right side all s = (x,p) with \p\ < A~3 is O(A^), and 
thus we get the order equality in (I4.52|) . 

□ 
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4.5 Proof that cumulative forcing vanishes 

Now we nearly have the material required to show the convergence sup 0<t<T |A^/}^ | =>■ 0, and 

more precisely, is O(X^). 

Part of our argument will rely on an analysis of certain processes related to the energy H r . 
Recall that the processes Qt, M t , and A r are such that 

Q t = (2H t ) ^ = Q + Mt + A t . 

The processes M 4 and A t are the martingale and predictable parts in the semimartingale 
decomposition of Q t , and both are initially zero. Let A\,Af,V x ■ T x R -> R be defined 
as in Section |2j The predictable component has the form A t = J Q dr A\(X r , P r ). We also 
define A^ = J Q * dr A^(X r , P r ). The martingale M t has predictable quadratic variation (M) f = 
jldr V\(X r , P r ). The proposition below is an extension of Proposition 12.11 and contains some 
more bounds regarding the functions Af, V\, and K\, n - 

Proposition 4.15. There exists C,c> such that for A small enough, 

1. For\-l < \p\ < 



A- x (x,p)--\\p\ <C\*\p\, V x (x,p)-1 



< CA2, and 



2/C Ai2 (x,p) 



< CAa. 



2. For all (x,p) G X, A x (x,p) < \D\(p)\. In particular, for \p\ < A 1 , one has A x (x,p) < 
CX\p\. 



3. For all (x,p) G S with \p\ > X' 1 , 4^fer > c> 0. 
4- For all (x,p) G S, 



A x (x,p) , 2AJp| 
£ x (p) "T 1+A 



£x(p) 
< C. 



5. As A —7- ; we /iai>e 



ds„4+(s) = 1 + 0(A). 



The following proposition lists two easy facts that will be referred to often. 

Proposition 4.16. For the split statistics, Nt~ X^n=i ^(^v n ) ^ s a martingale with respect to the 
filtration JF t . For the original statistics, Yln=i M*SV n ) — Jo dr h(S r ) is a martingale with respect 
to T t . In particular, 

E (A) [N t ] = E (A) 



dr h(S r 



Proof. The difference 



N, 



n=l 



71=1 



is a martingale by Part (3) of Proposition 14.31 The difference Y^=i ^O^n) — jl dr h(S r ) is a 
martingale according to the pre-split law, since the contributions h(S Tn ) occur with Poisson 
rate 1. 

□ 
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Define L t = u 1 J * dr h(X r , P r ). The following lemma states that rescaled versions of L t 
and Af are close, and gives a bound for the rescaled version of L t . (M) t is the predictable 
quadratic variation of the martingale M from Lemma 14.51 



Lemma 4.17. As X ^ 0, 
1. 



sup 

0<t<T 



X^Lt_ - A^At 



0{X±). 



Moreover, there is a C > such that for X < 1, then 

E {x) [\^Lt] < c. 



2. 



E^ 



sup 

0<t<T 



\»(M)t - \*v x Nt = 0(A3). 



Also, for any t > 0, the expectations are equal E (A) [(Af) J = t; A E (A) [jV t ] . 

Proof. 

Part (1): First, we will show that there is a C > such that for all A < 1 



Define the constant 



ax := E£ 



(A) 



drAl{S r 



'-jo 



E w [X^Nt] < c. 
hdsV^xWAtis) _ f s dxdpe- x & 2+v ^Ai(x,p) 



hdsVooMhis) J^dxdpe-^ 2 



-V(x)) 



where the second equality holds by Part (1) of Proposition ^. Ill By Part (5) of Proposition 14.151 
the compact support for h, and the fact J^dshls), we have that ax = u 1 + 0(A). Th 
expectation of X^Nt is bounded by 



Nt-1 

A 



E (a) [A5]Vt] = X^ + E (x) A^ a t / drA+{S, 

n=l J R ™ 



<A3 + -EW|As r drAi(S r ) =X l i+-E^[X l iAt}. 



(4.53) 



(4.54) 



The first equality uses the Markov property and the definition of a\, and the second equality 
merely goes back to the pre-split statistics. The inequality is for A small enough such that 
a x > |u. 

Since A+ = Qt — Qo — M t + A^, then by the triangle inequality, 
EW[A3A+] <2EW[ sup A3QJ +eW[|A3Mt|] + E^ [| aU z |] . 



o<t<- 



< 2E (A) [ sup A^Q t ] +EW A / drV A (£ 



E< A > 



A^ 



°<*<x 



< 2E (A) [ sup A^Q 4 ] +CfE (A) A / rfr (1 + AQ r ) 5 + C 2 E (A) A^ / drAQ r (4.55) 
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For the second inequality, the second term employs Jensen's inequality with the square function 
along with the fact that the martingale M t has bracket (M)t = J * drV x (S r ). The bound for 
the second term in the third equality is Part (3) of Proposition 12.11 and £\(P r ) < c(l + A|P r |) < 
c(l + AQ r ) for some c > by Part (2) of Proposition 13. II The third term in the third inequality 
is Part (3) of Proposition 14. 15l and \P r \ < Q r . The right side of ( 14.551) is uniformly bounded in 

A, since E^ [Q r ] < E^ [2H r ] * by Jensen's, and sup < r < t E< A > [H r ] < CX' 1 for some C > by 
Lemma 12.21 

It follows from the above that E^ A ^ [A^TVt] is bounded uniformly for small A. Since 

uE^[N t ] =m[Lt] 

by Part (1) of Proposition 14. 16( we also have that E^ A ^ [A^Lt] is uniformly bounded. 
To show that X^L± is close to A^At, we will consider the split dynamics. 



sup 

0<t<T 



E< A > 
< A^E (A) 



X^Lt - x*a: 



dr + sup 



E< A) 



sup 

0<t<T 



A^Li - A2A+ 



0<i<y J t 



R 



N t + 1 



dr) g x (S r ) 



sup 

0<t< 



X 

A n=l 



Rn 



g' x (S r ) , (4.56) 



where R n is the starting time of the n life cycle, N t is the number of R' n (not necessarily R n ) 
which have occurred up to time t, g x = A x + u _1 /i, and g' x = A x — u^h. 

The first term on the right side contains the boundary terms for the partition of the integrals 
over the interval [0, ?] using the life cycle times R n . The expectation E^ A ^ — R[] is smaller 
than c > uniformly in A < 1 by Lemma 14.61 so 



Ri 



A^fW / drg x (S r ) < A*fW / dr g x (S r ) + cA^ ( sup g x (s)) . 



s€S 



By Part (4) of Proposition 12. II and since h has compact support, g = A x + u 1 h satisfies the 
conditions of Proposition 14.111 By Part (3) of Proposition 14.111 we have 



drg x (S r ) =X* ldfi(x,p,z)E^ pz) I dr g x (S r ) 



R'i 



<CX^ [dji{x,p,z)(l + log(l + \p\))=C\$ [ dfi(x,p)(l + \og(l + \p\)) = 0(X*), 

for some C > and all A < 1. The integral above is finite by our assumptions on the initial 
measure fi. 

The other part of the first term on the right side of (I4.56P is bounded by 



f R 

sup / 

0<n<N T J 



n+l 



X 



Rn+1 

drg x (S r ) 



drg x (S r )\ <Xm^[J2 

n=l 

< C3A^E (A) [Nt^ = O(Ai). 
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The first inequality uses that sup n a n < (J2 n a n)^ f° r positive numbers a n > 0, and Jensen's 
inequality. The second inequality is for some C > by the strong Markov property for the 
split chain a n = S Tn and by Part (1) of Proposition 14.111 The order equality is by our earlier 
result. 

For the second term on the right side of (14.561) . the key observation is that 



tn+1 



dr g' x (S r 



Rn 



E 



(A) 



drg' x (S r 



'-jo 



/ E ds^ 00iA (s) h(s) 



hdse- XH Wh(s) 



is 0(A) for small A. The first equality is by the strong Markov property for the chain a n = S Tn 
and by Part (1) of Proposition 14.11 and the second equality is by Part (2) of Proposition 14.41 
The denominator of the rightmost expression approaches J s ds h(s) = u, and the numerator 
is a difference of terms which are 1 + 0(A). This follows since J s ds u _1 /i(s) = 1, by the 
approximation J s ds A x (s) = 1 + 0(A) in Part (5) of Proposition I4.15[ and since inserting the 
factor e~ XH ^ in the integrals will perturb the values by 0(A). 

With the above and the triangle inequality, the second term on the right side of (I4.56[) is 
smaller than 



|& a |A5E {a) [;Vt] + A5E 



(A) 



sup 

0<<<T 



L|JV t +ij 

E 

n=l 



R2n 



R% n - 



dr g' x (S r 



+ A5E (A) 



sup 

0<t<?r 



E 



R%n 



drg' x (S r ) - b x 



The first term is 0(A), since AaE^ [iVr] is bounded for A < I. However, 



E 

71=1 



R2n 



R2n- 



drg' x (S r ) - b x 



and 



E 

n=l 



Rzn-i 



drg' x (S r ) - b x 



R-2r, 



are martingales with respect to the filtration T[. Thus, we can apply the standard arguments 
to bound them: 



sup 

0<t<2 



E , 



L|JV t +ij 



dr g x (S r 



<2A*E< A > 

n=l 

< \^C^E w [Nt} 12 = 0(A^ 



R2n 

drg' x (S r ) - b x 

R2n-1 



The first inequality is Jensen's with the square function, and then Doob's maximal inequality. 
The second inequality is for some C > by the strong Markov property applied at the time 
i?2n-2 for the nth term in the sum followed by an application of Part (1) of Proposition 14.111 



Part (2): By Lemma |4.5[ (M) t is a sum of terms v\(Sn n ), and so the difference between 
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(M)t and v x N t can be written 



(M) t - v x N t = {MSrJ - vx) = V X (S R1 ) - v x (S Rl ) + EW [v x (S Rl )) + Ig> [v x (S Rl 



n=l 



2V< 



n=l 



where 5 S is the splitting of the (^-distribution at s £ E. Notice that -u A = f- E di , (s) v x (s). The 
sum on the right is martingale with respect to F[, by the same reasoning that M t is a martingale. 
Since S Rn £ Supp(z/) for n > 1, we have the standard inequalities: 



sup 



(M) t - v x N t 



< 4A 2 sup v x {s) 

sGSupp(i/) 



7V T 
T 



71=1 



< 4A2 supU A (s) + 22 AaE (A) 



A 



dsv\(s) - ( / (Jsd a (s; 



^ x 2 



<3(A 



□ 



Theorem 4.18. There exists a C > swc/i t/iat /or a// A < 1, then 



E (A) 



sup A 4 _D.i 



o<t<r 



In particular, there is convergence in probability 

sup \\iDt 

0<t<T ; 



< C. 



0. 



Proof. We will work with the split dynamics to show that 



E^ 


sup \D t \ 


= ew 


sup A 


= 0(A _ * 













Let Mt be the martingale from Lemma 14.51 We can write D t as 

A = / dr— (X P )- / dr — (X r ) 

r r R2 d.V i 

— E 



The triangle inequality gives 



Rz dV 
dr^-(X r ) 



Ri 



dx 



"S R - 

N t + 1 



JVt + l 



dx 



Mt- (4-57) 



sup | D t 



te[o,f] 



< 2E (A) 



Hi 



dr^(X r 
da; 



3E (A) 



sup 

fli<t<f 



E (A) 



sup |M t 

*e[o,?] 



. (4.58) 
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For the first term on the right side of (14.58j) . we can apply Part (2) of Proposition 14.141 to 



get 



1 dr^(X r )\] < I dj2(x, Pl z)E {x 
dx . 



(x,p,z) 



dr^-(X r 
dx 



<C f d/x(z,p)(l + log(l + b|))<C / dfi{x,p)(l+\p\), (4.59) 



where \i is the initial measure on E and \x is its splitting. By our assumption on /i, the first 
moment is finite, and hence (I4.59P is finite. 

The second term on the right side of (I4.58P is less simple, since it depends on t. Writing 



Rat. 



d Ax r 

dx 



dr ~Jz( X r) 



R N t 



dx 



' dAx r 



R N t 



dx 



then with the triangle inequality 



sup 

L °<*<x 



dx 



sup 

0<t<T 



R N t 



< E (A) 

< 2E (A) 

We can get rid of the first supremum with the inequality 



dr^(X r ) 



dx 



sup sup 

l<m<N t te[Rm,Rm+l] 



Rr, 



dx 



(4.60) 



sup sup 

l<m<NT t&[Rm,Rm + l] 
A 



R, 



dr^(X r ) <EW|V sup 

l m=1 te[B m ,R m +i] 

-t 



A 



R n 



dx 



< E w [Nt}2E { - 



(A) 



sup 

te[o,Ki] 



d Ax T 

dx 



2n 



0(A-4), (4.6i; 



where we have applied the strong Markov property at the times R m and have used that Sn m is 
distributed as v by Part (1) of Proposition 14. II By Part (1) of Proposition I4.14[ the expectation 
E~ A ^ above is bounded uniformly for A < 1. In the beginning of the proof of Lemma [4.171 we 
proved that E( a )[7Vt] = 0(A~2). Thus we have the last order equality above. 

Now for the last term in (14.581) . Applying Jensen's and then Doob's maximal inequality, we 
get the first two relations below 



E^ 


sup M t 


< E (A) 


sup \M t \ 2 


5 < 2E (A) nMT| 

Ll A 1 




L o<t<? 




L o<t<? 





= 2E (A) [(M)t] 5 = 2w|E {A) [Nt] ~\ (4.62) 

The second equality is by Lemma 14. 171 and v\ = J* s dv(s)v\(s). Since E^[Nt] = 0(\-2) and 
v\ is uniformly bounded for A < 1, the proof is complete. 

□ 
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5 Proof of Theorem 



1.2 



Now we put together the results from previous sections to prove the convergence of AsP A = P t w 
to the Ornstein-Uhlenbeck process over the interval [0, T]. 

Proof of Theorem \1.2i Theorem 14.181 contains the proof of the bound for the expectation of 
sup 0<t< T \D t \. 



Let P^''' be defined as in Lemma \3. 2 1 By Lemma [3.21 the difference P^ A> — P^ A> '' converges 



(A) 



>(A),/ 



to zero with respect to the uniform metric over the interval [0, T]. Thus we can work with P t 



(A)/ 



rather than P t [X) . Define the map Q : L°°([0,T]) ->■ L°°([0,T}) given by 



Q{h) t = ht 



h e L°°([0,T}). 



Notice that the solution p t to the Langevine equation (ll.4p has the explicit solution 

Pt = G(B' t ), 

where we have assumed po = 0. Moreover, the integral equation (13. ip for P^' 1 admits the 
closed form 



(5.1; 



p 



(A)/ 



e 2 



'^p + g(\^D T ) t + g(M(%. 



(5.2) 



By our assumption (2) of List 11.11 the moment E^ A ^[|Po|] is finite, and thus the first term 
on the right side of ( 15. 2ft converges in probability to zero as A — > 0. The random variable 



E< A > 



converges in probability to zero also, since 



sup \Q(X^D^ 

0<t<T 



< (1 + |)EW 



sup A2_D_J 

0<t<T 



0(X 



where the order equality follows by Theorem 14.181 By Lemma 13.31 converges in law 

to a standard Brownian motion B' with respect to the uniform metric. Since the map Q is 
continuous with respect to the supremum norm, Q(M^)t converges in law to the process Q (B'V 

with respect to the uniform metric. The process Q (B') t is Ornstein-Uhlenbeck, and thus P t 

converges in law to the Ornstein-Uhlenbeck process and P^' does also. 



(A)/ 



□ 



6 Miscellaneous proofs 

[Proof of Proposition 13.11 

1 9 2 

To ease the notations, we note g(q) = (2ir)~2e t. We have after the change of variables 
q = + ^±p = A±l _ p ) + \ p 

2t] f 2rj ( f°° f +Xp \ 

= XTT J dq ^ q ~ Xp \ g ^ = aTT v J q9 ^ dq + Xp J d 19(<l)j • (6- 1 ) 
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We have qg(q)dq = g(Xp), and f^ x £ dqg(q) < min(l, X\p\). By calculus, we also see that 
a (—7- g(a) + a dqg(q) has a minimum over K at 0. Hence 

1 - £ x (0) < £ x (p) < %(°) + 2? ? miD ( A IPUV) = (l + Cmin(A|p|,A 2 |p| 2 ) 



8(A + 1) ^ ' ~ AVV " (A+l) 8(A + 1 

We use exactly the same technique to estimate T>\(p). Here we find 

An / f +Xp f°° f +Xp \ 

Va{p) = (XTTp \ J x q2g ^ dq ~ AXp J x q9 ^ dq ~ x2p2 J x 9 ^ dq ) (6 ' 3) 

'^a(p)-tt^9 f +XP 9(q)dq. (6.4) 

-Ap 



A + i AVV (a + i; 

To prove the first inequality involving T>\(p), we use (16. 3p . For the first integral, we note that 
for \q\ < X\p\, one has q 2 g(q) < X 2 p 2 g(q). We also observe that there exists a constant C > 
such that \g(q) — g(0)\ < C\q\ for all q, so 

16^(0)Ap 2 2 

P A(P) = ~ ( A + 1 )2 + ( A P )■ 

The second inequality involving V\{p) is an obvious consequence of (16.4j) . 
Now, we repeat the computation for n\ (p) and we get 



n?>(p) = -^-^ ((2AV + %(Ap) + Xp(3 + AV ) £\(Q)dq^j (6.5) 



4(AV + 2) Sr/Ap 



+Ap 
Ap 

/+Ap 
^ ,(,)*. 

Using (I6.4p . we have 

0L MP? _ „ l(p) > = __±L_ + _±L_ £ M«) . 

Combining the two previous equality with (16. ip . we find that 

F(\n<\ 327 19(Xp) 16r/ 2 f +Xp f +Xp 2 2 2w w 

£a(p)Qa(p) = ( A + ^3 g MP) + (A + 1)4 J x g ( q ' q J x ( P ~ q > 9 ^ dq - 

The first and the third equalities show that 

32^(Ap) 8£ A (p) 

TaW - Qa(p) " (ATTF' 



For 



which is enough to show the upper bound for Q\(p) and the estimate for Q\(p) — n+i) 3 

the lower bound, we deduce from the preceding calculations that (A + l) 3 Q\(p) is actually a 
function of Xp, which is continuous, strictly positive and goes to infinity as Xp — >■ ±oo, hence 
has a strictly positive minimu ml not depending on A. 



2 This minimum is not at 0. 
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Finally, reasoning as above, it is easy to produce an upper bound for Ir x (p) which is a 
polynomial of degree 2m + 1 in A|p|. 

The inequalities in Part 7 concern the moments for the momentum distribution when con- 
ditioned to jump from a momentum p in one region to a momentum p' in another region. 
Both inequalities follow from the monotonic and sharply decaying form of the tails of the rates 
r\, P (p') = J\(p,p'). 



[Proof of Proposition 14.31 

We will show (1). Let (x t (x,p), p t (x,p)) be the trajectory starting from the phase-space 
point (x,p) and evolving according to the Hamiltonian H(x,p) = \p 2 + V{x). The kernel 7a 
satisfies the following closed integral equation 

poo 

T x (x,p;dx',dp') = / dt5(x t (x,p) -x',p t (x,p) -p^e-fodsii+SxiPs^p))) 
Jo 

+ r dt [ dp" ?^^^ 

Jo Jr £\{Pt{x,P)) 

By iterating this equation, we obtain a series expansion for 7a where the nth term corresponds to 
the event that n — 1 collisions occur over the time interval [0, t}. By considering the contribution 
to the transition kernel 7a due to a single collision occurring over the mean-1 exponential time 
interval, we have the following lower bound 

T\(x,p; dx',dp') 

>dxdp dp dt dt 2 / dt x —— -v —7 ; : r— jtt 

Jr Jo Jo Jo £\{Pt 1 {x,p)) 8 x [p t2 ^ tl {x tl (x,p),p >")) 

e - J *- t2 ds (l+e x (p- a (x',p'))) e - /J 2 -* 1 ds (l+£ A (p s (x tl (x,p),p"))) e - Jo 1 ds (l+e x (p s (x,p))) _ /g g) 

Let A C K. be the set of p with 2/ < |p 2 < 5/. Let c > be the minimum of the values 

. f ■ , J\(p,p') A ■ c ■ f J\(P',P) 
mi mi r— r — and mi mi — — . 

\<lp2<l,p>£A £\(p) X<lp-2<l,p'^A £\(p') 

We have defined the set A to exclude the arguments p, p' of J\ from being close, since the 
rates J\{p,p') have a zero along the line p' = p. If H(x,p), H(x',p') < I, the conservation of 
energy guarantees that Ip^x,^) < I and \p 2 _u_ t2 \{x' ,p') < I. Also, if 3/ < \{p") 2 < 4/ , then 
Pt 2 -t 1 ('x tl (x,p),p") G A, since the kinetic energy can not fluctuate by more than / through the 
Hamiltonian evolution. For all (x,p), (x',p') with H(x,p), H(x',p') < I and all A < 1, 

T\(x,p; dx' , dp') >c 2 dx'dp'( / dp"). 

J A 

Thus, we can take u = c 2 U 2 ( J A dp"). 

Now, we prove the upper bound for T\(s,ds'). Notice that for \1/ e n L°°(E), then 

||'7A( x E f ) ||oo < Halloo- I n other words, 7a is a contraction in the supremum norm. This is evident 
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from the resolvent form 7a = J °° dte t &t,\, where &t,\ are the dynamical maps for the Master 
equation (11.51) and by the inequalities 



POO 

|Ta¥|l< / die-* w^ww^k m 

Jo 



The maps &t,\ are contractive in the supremum norm, since the dynamics is driven by Hamil- 
tonian flow which preserves the supremum norm, and noise satisfying the detailed balance 
condition 

When H(s') 7^ H(s), then a collision must occur over the time interval [0, Ti] in order for 
So = s and S Tl = s'. Considering the event that the first collision occurs before time t±, then 
the strong Markov property gives the first equality below: 



%(s,ds')=E^ 



x{ti<n)T x (S il ,ds') <||£>a, s |U (6.7) 



where is the probability density of the first collision when starting from s G E. The density 
has the closed form 

POO 

Di x \s')= / dt5(x t (s)-x')j x (p t {s),p')e-ti dr£ ^(s)). 
Jo 

When H(s) > 1 + 2sup x V(x), the particle will revolve around the torus freely with speed 
\p\ > (2 + 2 sup x V(x)) 2 . Using the above form for Da . 

Di X) (s')<(sn P ^p-) rdt5(Ms)-x')e^^re x ( Pr (s)) £x{pt{s)) 
V P ,p> Sxip) / Jo 

- \p£ Sx(p) q(s,x>) J 



q(s,x') 



. ( J\{P,P') 

< I sup — . . . 

V p,p' £\\P) y V 1 _ e ~ St da q {s,a) 



where q(s,a) = 2^{H{s) — V(a))i. The two terms on the right are bounded uniformly for all 
A < 1 and s with H(s) > I. 



[Proof of Propositions 12.11 and 14.151 

Our first observation is that Ax (x, p) = A\(x, — p), £\(p) = £x(~ P) and V\(x,p) = V\(x, — p). 
So for all the inequalities we have to prove, we can assume without loss of generality p > 0. 

3 3 

Assume now that A~s < p < We note that 

V— — S- i 

H(x,tf)3 - H(x,p)* = \ r = -={p' -p)T{j/,p), 



V(x)Y+(£ + V(x)y 
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where T(p', p) := 2 2 (p + p') 



. Thus one can write 



A\{x,p) = V x {p) - / dp'{p' -p){\ - r(p,p'))J x (p,p'). 
We know from Proposition 13.11 that 

< dxy + c 2 x 2 \ P \ < c\*\p\. 



(6.1 



To estimate the integral in ( 16. 8ft . we split it into two parts: \p' — p\ > | and |p' — p| < | where 
p = ]z^p is the center of the Gaussian density in J\. The first part is actually in the Gaussian 
tail and goes to at exponential speed when A — > 0. Indeed, the quantity |r(p,p')| is bounded 
by 1 and we have after the change of variables q = ^-p' + ^i%>, 



I 

j \p— p\ 



dp'\p' - p\J\(p,p') 



it] 



(A 



q\> 



(l-A) 



dq(q — Xp) 



1 

e 2 



2tt 



(6.9) 



Since - ^ p — > 00 as A — > 0, we can bound this by, say, C'X± \p\. The remaining part \p—p\ < | 
of the integral in (I6.8P can be as large as Xp if p is not large enough for fixed A. Now we show 

3 

that this does not happen if p > X~». First, we note that the function T(p,p') is symmetric 
and it is non- decreasing in p'. More precisely, for | < p' < ^ and A small enough, 



< 



dT(p,p') 
dp' 



(p 2 + 2V(x))2(p' 2 + 2V(x))2- pp' + 2V(x) 
(p' 2 + 2V{x))* ((p 2 + 2V(z))3 + (p' 2 + 2V{x))\ 



< 



C 

p3 



(6.10) 



We also have for f < p' and A small enough 



o<i-r(p^)<i-r(|,|) = i 



< 



c 



(p 2 + 32V(x))i P 



(6.1i; 



Using (16. lip the fact p — p = and (I6.10p . we can write (with r = p' — p) 

dp'ip' - P )(i-r(p,p'))j x (p, P ') 

T)(l + X) 



cxp 



dr(p — p + r)\p — p + r \ (1 —T(p 1 p + r)) 



1 f (A+l), 



2tt 



7^(1 + A) 



cxp 



drr 2 (r(p,p + r) — T(p, p — r)) 



2 I 2 1 



0(A3p) 



0(-) + 0(A4 P ). 



3 3 

For p > A~s, the above can be bounded by CX^p. Note by the way that T(p,p+r) > T(p,p—r), 
so the corresponding integral is strictly positive. Since §^(p, p) = V(x)(p 2 + 2V(x))~^ ~ Cp~ 3 , 
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this means that the contribution of the integral in (16. 8 ft can be of the same order as T>\(p) Hp 
is not large enough for fixed A. Finally, collecting the above estimate, we have proved that for 

3 3 

A~5 < p < and A small enough, 



a , \ hp 



< C\*p. 



This is enough to show that in this case, A\(x,p) < for A small enough, so the proof of our 
first inequality is done. 

3 1 

Our previous analysis implies that for < p < , 

{A rrf = ttt + °< A, " 2 > = °< A2p2 > = °< A '>- 

£\(P) A + 1 

Using the definition of T, we have 

V X (x,p) = 2/C Aj2 (X,p) — r-r 

= n< 2) (p) - ( ^'f )2 + / dp\p> - pf (Y{p,p'f - 1) J x (p,p>). (6.12) 
^a(PJ 

From ( 16.5ft . we see that 

n< a) (p) = i + o(Ai). 

The analysis of the remaining integral in (j6. 12ft is done as before. More precisely, when \p'— p\ < 
|, we can still apply (16. lip and we have 

2(7 

o < i - r( P ,p') 2 = (i - r(p, P '))(i + r(p, j/)) < — . 

Next, we write 

/ dp\p'-pni-Vip,p'f)J x {p,p')<—^ \q-M"~ -t=. (6-13) 

i| (A + l) d iui<(i^)„ r V27T 



(A + l) 3 i k |<(i_^) 3 

Since |g — Ap| 3 < (|g| + l) 3 and p~ 2 < At, the above integral is O(A^). Finally, the integral 
in ( 16. 1 2ft over \p' — p\ > | goes to when A — > at exponential speed in A, so is also O(Aa). 
Collecting the previous estimates, we find that \V\(x,p) — 1| < CA^ and |2/C A ,2(^9 p) — 1| < CA^ 

3 3 

for A small enough and p~» < p < A~*. 

For the proof of A\~(x,p) < \T>\(p)\, we consider p > 0. Then we have to show that the 
integral in (I6.8P is nonpositive. For this, we use the monotonicity of T(p,p') and the following 
straightforward inequality (valid for all p, r > 0), 

J x (p,p + r) < J\(p,p-r). 

Then we have 

dp'{p' - p)(l -T(p,p'))J x (p,p') 

POO 

rdr(l — T(p,p + r))J\(p,p + r) + / rdr(T(p,p — r) — l)J\(p,p — r) 

Jo 

< I rdr(T(p,p — r) — T(p,p + r))J\(p,p — r) < 0. 
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In order to prove that + is bounded, it is again enough to consider the case 

p > since £\{p) = £\{—p). Moreover, in view of Proposition 13.11 and (16. 8p . all we have to do 
is to prove that the integral in (16. 8p is bounded by C + C'Xp for suitable C, C (actually, our 
analysis shows that one can take C = 0). If p < A" 1 , then (q — Xp) 2 < (\q\ + l) 2 . So we can 
do a change of variable as in (16. 9 p and we get that the integral in (16 .8p is bounded in this case. 
Now, we assume that p > A -1 . For further use, we prove a stronger result, namely that in this 
case, the integral in (16 .8p goes to as A — )• 0. Indeed, we can split this integral as before into 
\p' — P~\ > f anc ^ \p' ~ P\ < I • ^ ne \p' ~ P~\ — 2 P ar ^ * s again i n the Gaussian tail hence goes to 
as A — > (see (16. 9p ). Finally, for the \p' — p\ < | part, we use (16. lip and we get 

2 dp'\(p'-p)(l-T(p,p'))\J x (p,p')<—^-, / , dq (q-Xp) 26 



(A + l) 2 J\g\< (i_M p P 2 ^ 



Since p > A 1 , we have p 2 (g — Ap) 2 < A 2 (|g| + l) 2 , and the above integral goes to as A — > 0. 
So the proof is complete. 

Now we want to find a lower bound for -£^r for \p\ > A" 1 . It is still enough to consider 
p > 0. Then by (16. 8p and (16.4)) . we have for all p > 

< 2Ap /" j„/^„/ „ W1 ^ „,^J\(p,p') 



f d p'(p'-p){i-T{p,p')y 

1 Jr 



S x (p) ~ A + l 7 R ^ v ^ yy 5 A (p) 

A+l 



For p > X 1 , we have — ttt < — 1. Since we have just proved that the remaining integral goes 



to as A and p > A" 1 , we conclude that for A small enough, < — ~. 

In Proposition 13. 1[ we proved that Qa(p) < C£\(p). Thus, to prove the same thing for 
V\(x,p), we have to prove that there are constants C, C such that for all p > 0, both ^t^j — 

and the remaining integral in (I6.12p are bounded by C + C'Xp. First, we write T>\ — A\ = 
(V\ — A\) (T>\ + Ax)- We already proved (in the proof that + j+x is bounded) that 

T>x(p) — Ax{x,p) (which is the integral in (I6.8P ) is bounded. We also proved above and in 
Proposition 13. II that both and A £^ p ^ are bounded by ^jjj + C for some C . Regarding 
the integral in (I6.12p . we consider two cases: p < A" 1 and p > A -1 . We have 

dp\p' -p)\l-T{p,p') 2 )Jx{p,p')<--^— [ \q-Xpf~^ 



{X + lfJ^ v/2V 

If p < A -1 , then \q — Xp\ 3 < (\q\ + l) 3 , so the integral is bounded. If p > A -1 , then the integral 
over \p' — p\ > | decays exponentially when A — ?• 0, so a fortiori is bounded. Finally, when 
p > A" 1 , for the integral over \p' — p\ < |, we use (I6.13p . In this case, we have \q — Xp\ 3 p~ 2 < 
X 2 \q — Xp\(\q\ + l) 2 . So we can bound the integral by C + C'Xp. 

Now we prove the upper bound for A^(x,p). We assume p > 1. Recall that p — p = |^ so 
we can write 

= / dp'(p' - p)T(p,p')Jx(p,p') - —— / dp'T(p,p')Jx(p,p'). 
Jr a + l Jr 

But we have 

-r^r / *T(p,j/).7a(p,j/) < -^r / VlW)^,?') < / P dp'Jx(p,p')- 
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This is exponentially decreasing in p (uniformly in A). Now, we write (r = p' — p and g(q) 

1 ^ 

(2ir)~2e 2") 

2 



/ 



dp'{p' - p)T(p,p')J x (p,p' 



, 2Ap 
A+l 

< / rdr 



r/(A + l 
rdr ( T(p,p + r] 

2Xp 



2\p 



A + l 



T(jp,p — r) 



2\p 



A + l 



A + l 
9 I — r 



o 



A + l 



(r(p, p + r) — r(p, p — r)) g [ — ^— r 



A + l 



+ / r dr (r(p,p + r) — p — r)) g 

2Ap \ 2 

A+l 



In this last inequality, we used the fact that T(p, p + r) + T(p,p — r) > 0, which is true because 
T(p,p — r) > T(p, —p — r) and \T{p, —p — r)\ < T{p,p + r) for all r > 0. Now, for the first 
integral, we can use ( 16.10P and we find that it is bounded by For the second integral, we 

first observe that the integral over r > | is decaying exponentially, and for the integral over 
< r < f , we use again ( 16 . 1 j) and we get a p- bound. 

The bounds for Kx, n are straightforward since by definition of T, one has 



JC\, n (x,p) < 2-f f dp'\p-p'\ n J x (p,p') 



In the same way, we have 



1C x , n (x,p) < 2"t / -pTWI > IpI)^a(p,p')- 
Jr 

Since /C^n^, — p) = fc\, n (x,p), we may assume without loss of generality that p > 0. Writing 
r = p' — p, we have 



fCx,n(x,p) < C r 



Qf r m+1 dr# (^pr + Ap) + jf r m+1 dr# ( 



A + l 



-r — Ap 



- Cn (l°° rm+ldrg { 



2p 

*+i r ) +J T r ~«* 9 (I 



Therefore, K,x, n (x,p) is bounded by some C„ > 0. 
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A Exponential ergodicity 

In this section, we present for convenience a self-contained proof of the relaxation of our dy- 
namics to an equilibrium state. Our dynamics is driven by the forward Kolmogorov equation 

j t ^t,x(x, p) = (£* x y tjX )(x, p) = -p^ t! x(x,p) + ^(x)^ tyX (x,p) 

dp'(j x {p',p)V ttX {x,p') - Jx(p,p')%,x(x,p)), (A.l) 



which has equilibrium state \I/oo,a- The Kolmogorov equation (lA.lj) determines a transition 



semigroup <&t,\, which we take to operate on measures from the right and bounded measurable 
functions from the left: 

(li$ tyX )(ds) = n(ds')<S> t>x (s',ds) and ($ tyX g)( s ) = /$ t)A (s, ds')g(s'), 

for yU G M(E) and g G -B(E). We identify probability densities with their corresponding 
measures and denote the total variation norm for finite measures on E by || ■ || i- The exponential 
ergodicity that we prove in Theorem lA.ll is not used critically anywhere in our proofs, although 
we need some degree of ergodicity in order to make sense of certain expressions such as the 
reduced resolvent 9\ x of the function for instance. A more thorough study of the ergodicity 
would give some control of the the exponential rate a (A) appearing in Theorem IA. 1 1 for A <C 1. 
We believe that a (A) can be taken to scale as oc wjpi) f° r small A (i.e. a bit slower than 
linear in A), and if C is replaced on the right side of (1A.2I) by CH^H^ for the weighted norm 
II^IU — Jx 1^1 (dxdp) (1 + Aa|p|) , then the exponential rate scales as oc A for A <C 1. 
We denote the space of probability measures on E by M+(E). 

Theorem A.l. Let & t ,\ be the transition semigroup corresponding to the Kolmogorov equa- 
tion ( L4. There exist a(A), C > such that for all ^ G M^_(E) 7 



ll^-^cA^ <Ce~ ta ^. (A.2) 

Proof. It is sufficient to work with the resolvent chain rather than the original process and show 
that there are C, a > 

Ce~ na > - * 



sup 

ll9ll°o<l 



oo,A 1 1 j 

(*7?)(ds) g(s) -*oc,a(<7)|, (A.3) 
where 7a : -B(E) — > -B(S) is the transition operator for the resolvent chain and has the form 

POO 

T x = / dre- r $ r , x . 



JO 

There exist L > and < e < 1 such that the following statements hold: 
(i). For all s, s' G E with H(s), H(s') < L, 

T\(s,ds') > e/j,^ ds'. 



(is 



(ii). For all s G E with H(s) > L, 



Tx(s,ds') > 



H(s')<L 



In the above, lll is defined to be the Lebesgue measure of the region {H(s) < L} C E, and 
we have inserted this constant for convenience. Given any L > 0, we can find an e such that 
statement (i) is true by the argument in the proof of Part (1) of Proposition 14.31 (in which the 
cutoff was 1 = 1 + sup x V(x) rather than an arbitrary fixed L, but this does not change the 
argument). The statements (i) and (ii) imply that the probability of jumping into the region 
{s G E | H(s) < L} is at least e > from any starting point. The statements (i) and (ii) and 
the fact that ^oo.a is a stationary state for the dynamics are sufficient to prove the exponential 
ergodicity without further reference to the dynamics at hand. The proof of (ii) is below, and 
we proceed next with the remainder of the proof. 

By (i), we have the minorization condition T\(s,ds') > h{s)v{ds') for 

h(s) = ex{H(s) < L) and v{ds) = ds ^X (H(s) < L) . 

Let aGE = Ex {0, 1} be the split chain defined using the pair h, v above and n m , m > 1 be 
the sequence of return-times to the atom. The times n m form a delayed renewal chain in which 
the delay distribution is [fii = n], and the jumps have distribution P^[ni = n]. 

We will treat the difference between ^T™ and \Poo,a in the norm || ■ ||i through the formula 
(|AJ). For g E 5(E), 

J^T x n )(ds) g(s) =E^[g(a n )} = [g(S t )] 

n oo 

= ^4 A) [ x ( m ~ 1 = h r)gMx(n r+1 > n) 

m=l r=l 

n 

= Ef [g(a n ) x (ni > n)] + ^ " 1) ^ [gK-m)x(ni >n-m)], 

m=l 

where : N — > M + is the renewal function 

oo 
r=l 

We will delay the demonstration of the following two statements until the end of the proof. 
(I). The is a c > such that for all m e N and probability measures \1/ e M^_(E), 

ff[ni >m]< ce~ em . 



(II). There is a c > such that for all m G N and probability measures \1/ G M^(E), 

||4 A) H-*cx,,A(/i)||oo<ce- em . 
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With the above statements 



{*T?)(d8)g(a)-V eo M < E^[g(a n ) X (fii> 
LfJ 

+ \ F *\ m - !)| |4 A) [<?K-m)x(ni > n - 



n 



in 



m=l 



+ *oo,a(/i)| E 4 A) bK- m )x(^i>^-m)]-|^M 



m=LfJ+l 



|4 A) M 



*oo,a(/i)|e1 a) [y(«7 n _ m )x(fii > 



n — m 



)] 



+ £ 

m=LfJ+l 

< cll^lU^l - e) n + e-V 6 ^ + e- 1 e- e ^ J ^ 00iA (/i) + e"^"^ 



- 

2 J 



(A.4) 



where for the first inequality, we have applied the triangle inequality after adding and subtract- 
ing the term 

n 

*oo,x(h) E Ef ] [g{a n „ m ) X {ni >n-m)]. 

m=LfJ+l 

The second equality in (1A.4I) uses (I) and (II), that F^{m) < 1 for the second term, and the 
identity 



E ^ Wm)x{^ > m)] = 4 A) E 



m=l 



m=0 



*oo,a(/i) 



(A.5) 



for the third term. The second equality of (1A.5|) follows by Part (2) of Proposition 14.141 The 
last line of (1A.4|) is H^Hoo multiplied by 0(e 2 ) ; and so we have proven (1A.3[) for a — ~ when 
assuming statements (ii), (I), and (II). 

(ii). We must return to a consideration of the process St underlying the resolvent chain. Let 
r be a mean-1 exponential and So = s G S. For H(s) > L, let q be the hitting time that 
H(S^) < L. Since the Hamiltonian evolution preserves energy, we can give a lower bound for 
our quantity of interest as 

/ %.{., ds') > Pf) [c < r and no collisions occur over the time interval (c, r] ] 

JHts'XL 



>( inf — -r)pWf f <Tl 



(A.6) 



The second inequality uses that the collisions occur with Poisson rate £\(St) and that t — q is a 
mean-1 exponential when q < r and conditioned on the information up to time q. The escape 
rates £\(S t ) are uniformly bounded over any compact region, and it is thus sufficient for us to 
give a lower bound for P^ [<r < r] . For any T > that we choose, 

POO 

inf P (A) k<r] = inf / dt e -*P (A) \q < t] 

H(s)>L H(s)>Lj Q L J 

> e~ T - e~ T sup P^ A) [q > T] . 

H(s)>L 
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If we show 8\xp H r a \ >L [q > T] is small for large T (or even merely bounded away from one), 
then combined with (IA.6j) the proof of statement (ii) is complete. 



Define the function w : E — > [|, 1] such that 

1 

w(s) = 1 — -- 



2l + H-2(s) 

There exists L large enough so that for some 5 > 

sup / dp' Jx(p,p') iw(x,p') —w(x,p)) < —6. (A. 7) 

H(x,p)>L Jr v ' 

Even though the rate of increase for w(x,p) as \p\ — >• oo is gentle, it is still visible through 
the jump rates J\{p,p') in (1A.7j) . because the escape rates £\{p) = f R dp' \J\(p,p') grow on the 
order of \p\ by Part (1) of Proposition 13. 1[ and the landing p' for a jump from p with \p\ 3> 1 is 
concentrated around p' = j^P- In other words, when the momentum of the particle is high, a 
collision will occur quickly and the resulting momentum of the particle will typically be fraction 
of the former value. 

The inequality (1A.7|) implies that tS + w(St) behaves as a supermartingale when H(S t ) > L, 
since for s = (X t , P t ) 



d 
dr 



E^[r5 + w(X r ,P r )} = 5+[ dp'J x {P t ,p') (w(X t ,p')-w(X t ,P t )) < 0. 



The equality follows since the function w is invariant of the Hamiltonian evolution. We have 
the following sequence of inequalities 

L*TI / , llr „\L*rj 



sup Fi x) [^>T]<( sup Pi A) [^>r 1 ]) < f sup SE^k AS' 1 ] 
<( sup [w(s) - w(S^)]) l5Ti < 2-^. 



H(s)>L 

For the first inequality, the event q > T requires that the time q failed to occur over disjoint 
time intervals (n5 _1 , (n + for < n < [ST\ — 1. The second inequality is Chebyshev's 

and the third is by the optional stopping theorem, since w(s) — w(S t ) — St is submartingale 
over the time interval t E [0,q A S^ 1 ]. For the last inequality, | < w(s) < 1. We can choose T 
to make the right side arbitrarily small. 



(I). The second equality below follows from an inductive argument using the fact that the 
probability of the event a n E S x 1 is h(a n ) when conditioned on the information up to time 
n — 1 and the value <j n . This property is visible from the form of the transition operator 7a 
given in Section 14.11 

P^ A) [h x >m]= Ef [ x {n x £ 1) • • • X (ni ? m - 1)] 
= 4 A) [(1 - h(a )) ■ ■ ■ (1 - h(a m ^))] 

= E^ [(1 - h{a )) ■ ■ ■ (1 - % ra _ 2 ))EW [(1 - h{a m -t)) \a ,..., a m . 2 ]] 
< (1 - e)E { i ] [(1 - h(a )) •■■(!- h(a m _ 2 ))]. 
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The third equality identifies the expectation as from the original statistics, and the inequality 
uses that the probability o~ m -\ jumps from a m _2 to the support of h is > (1 — e) by (i) and (ii). 
Applying this inductively, we get the bound. 

(II). By the renewal theorem, F^ x \n) converges as n — > oo to the inverse of the expectation of 
the renewal increments n m +\ — n m . The expectation of the increments n m+ i — h m is given by 

hi -. 
4 A » [fw. " n m ] = 1 + E« [Sj = f « [£ l] = 

?1=0 °°' 

since the distribution for the split chain following a return to the atom is v by Part (1) of 
Proposition 14.14 an d where the second equality is by Part (1) of Proposition 14.41 The tails 
of the delay distribution [ni > n] and the renewal jump distribution P~ A ^ [ni > ra] decay 
with order 0(e~ en ) for large n by (I). It follows that the renewal function F^{n) converges 
exponentially with rate e to the value ^oo,\(h). 

□ 
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